Gene B21_01421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01421 
SymbolansP 
ID8116494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1477955 
End bp1479454 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content52% 
IMG OID644847664 
Producthypothetical protein 
Protein accessionYP_002999237 
Protein GI251784933 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00419609 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAC ACGACACCGA CACTTCAGAT CAACACGCCG CGAAACGCCG CTGGCTTAAT 
GCCCACGAAG AGGGGTATCA CAAAGCGATG GGCAATCGCC AGGTACAGAT GATCGCCATT
GGCGGCGCGA TTGGCACCGG CTTGTTTTTA GGTGCAGGAG CCCGACTGCA AATGGCGGGT
CCAGCACTGG CACTGGTTTA TTTAATTTGT GGCTTGTTTT CGTTTTTTAT TCTGCGTGCA
TTGGGTGAGC TGGTGCTACA CCGCCCTTCC AGTGGCAGTT TTGTCTCTTA TGCCCGTGAG
TTTTTGGGTG AGAAAGCCGC TTATGTTGCT GGCTGGATGT ACTTCATCAA CTGGGCGATG
ACGGGGATTG TTGATATTAC CGCCGTCGCT CTGTATATGC ATTACTGGGG TGCGTTTGGC
GGCGTGCCGC AGTGGGTCTT TGCGCTCGCT GCACTTACCA TCGTTGGCAC CATGAATATG
ATCGGTGTGA AATGGTTTGC GGAGATGGAG TTCTGGTTTG CGCTTATTAA AGTGCTCGCC
ATTGTGACCT TTTTGGTCGT GGGTACAGTG TTCCTCGGTA GTGGTCAGCC GCTGGATGGC
AACACCACTG GCTTTCATTT AATCACCGAT AATGGCGGCT TCTTCCCCCA CGGTTTGCTG
CCTGCGCTGG TGTTGATTCA GGGCGTAGTG TTTGCTTTTG CCTCCATTGA AATGGTGGGT
ACAGCTGCCG GAGAATGTAA AGATCCGCAG ACCATGGTGC CTAAAGCCAT TAACAGCGTG
ATTTGGCGTA TTGGCCTGTT TTACGTCGGC TCCGTGGTGT TGCTGGTTAT GTTATTGCCG
TGGAGCGCGT ATCAGGCGGG GCAAAGTCCG TTCGTGACGT TTTTCTCTAA ACTGGGTGTG
CCATATATCG GCAGCATTAT GAACATTGTG GTGCTGACCG CTGCCCTCTC CAGCCTGAAC
TCAGGTCTGT ACTGCACCGG ACGTATTCTG CGCTCAATGG CGATGGGCGG TTCCGCACCG
AGTTTTATGG CGAAAATGAG TCGTCAGCAT GTGCCGTATG CCGGGATTCT GGCGACACTG
GTTGTGTATG TCGTCGGCGT ATTCCTCAAC TATCTGGTGC CGTCGCGCGT ATTTGAGATT
GTGTTGAACT TCGCGTCGCT GGGAATCATC GCTTCATGGG CGTTTATCAT CGTGTGCCAG
ATGCGCCTGC GTAAAGCAAT TAAACAAGGC AAAGCAGCGG ATGTCAGTTT TAAACTGCCT
GGCGCGCCCT TCACTTCCTG GCTGACATTA CTGTTTTTAC TGAGTGTCCT GGTGCTGATG
GCGTTCGATT ACCCGAACGG GACTTATACT ATCGCGGCGC TGCCGATTAT CGGTATTCTG
CTGGTTATAG GCTGGTTTGG TGTGCGCAAA CGCGTTGCTG AAATTCACAG CACTGCGCCA
GTCGTCGAGG AAGATGAAGA AAAACAGGAA ATTGTGTTTA AGCCTGAAAC GGCGAGCTAA
 
Protein sequence
MSKHDTDTSD QHAAKRRWLN AHEEGYHKAM GNRQVQMIAI GGAIGTGLFL GAGARLQMAG 
PALALVYLIC GLFSFFILRA LGELVLHRPS SGSFVSYARE FLGEKAAYVA GWMYFINWAM
TGIVDITAVA LYMHYWGAFG GVPQWVFALA ALTIVGTMNM IGVKWFAEME FWFALIKVLA
IVTFLVVGTV FLGSGQPLDG NTTGFHLITD NGGFFPHGLL PALVLIQGVV FAFASIEMVG
TAAGECKDPQ TMVPKAINSV IWRIGLFYVG SVVLLVMLLP WSAYQAGQSP FVTFFSKLGV
PYIGSIMNIV VLTAALSSLN SGLYCTGRIL RSMAMGGSAP SFMAKMSRQH VPYAGILATL
VVYVVGVFLN YLVPSRVFEI VLNFASLGII ASWAFIIVCQ MRLRKAIKQG KAADVSFKLP
GAPFTSWLTL LFLLSVLVLM AFDYPNGTYT IAALPIIGIL LVIGWFGVRK RVAEIHSTAP
VVEEDEEKQE IVFKPETAS