Gene B21_00110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00110 
SymbolaroP 
ID8115388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp122983 
End bp124356 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID644846404 
Producthypothetical protein 
Protein accessionYP_002997977 
Protein GI251783673 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00481643 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGAAG GTCAACAGCA CGGCGAGCAG CTAAAGCGCG GCCTTAAAAA CCGCCATATT 
CAGCTTATCG CGCTGGGTGG CGCGATAGGG ACCGGGTTAT TCCTGGGTAG CGCCTCCGTA
ATACAGTCCG CAGGGCCAGG GATTATCCTG GGTTACGCCA TTGCTGGTTT TATCGCCTTT
CTGATCATGC GTCAGCTGGG TGAAATGGTG GTCGAAGAAC CTGTCGCAGG CTCCTTTAGC
CACTTTGCTT ATAAATACTG GGGCAGTTTT GCCGGTTTCG CCTCTGGCTG GAACTACTGG
GTACTGTACG TTTTAGTTGC CATGGCTGAG CTGACTGCCG TGGGTAAATA CATTCAGTTC
TGGTATCCGG AAATCCCCAC CTGGGTTTCT GCCGCCGTAT TCTTTGTGGT GATTAACGCC
ATCAACCTGA CCAACGTTAA AGTGTTTGGC GAGATGGAGT TCTGGTTTGC CATTATCAAA
GTTATCGCGG TGGTAGCGAT GATCATCTTC GGCGGCTGGC TGCTATTCAG TGGCAACGGC
GGCCCGCAGG CGACCGTTAG CAACCTGTGG GATCAGGGTG GTTTCCTGCC GCACGGCTTC
ACCGGGCTGG TGATGATGAT GGCGATTATC ATGTTCTCGT TCGGTGGTCT GGAACTGGTG
GGGATCACCG CAGCAGAAGC TGATAACCCG GAGCAAAGTA TACCGAAAGC AACTAACCAG
GTTATCTACC GCATCCTGAT TTTCTATATT GGTTCGTTAG CCGTTCTGCT CTCACTGATG
CCGTGGACCC GCGTTACCGC CGATACCAGT CCGTTTGTGC TGATCTTCCA CGAGTTAGGC
GATACCTTTG TGGCGAATGC GCTGAACATC GTGGTACTGA CTGCGGCGCT CTCCGTGTAC
AACAGCTGCG TATATTGCAA CAGCCGTATG CTGTTTGGTC TGGCACAACA GGGTAATGCG
CCAAAAGCGC TGGCGTCTGT CGATAAACGT GGTGTACCAG TAAATACCAT TCTGGTGTCT
GCACTGGTAA CGGCGTTGTG CGTACTGATT AACTACCTTG CCCCAGAGTC CGCTTTCGGA
CTGTTAATGG CGCTGGTGGT ATCTGCACTG GTAATCAACT GGGCGATGAT TAGCCTGGCG
CATATGAAAT TCCGTCGCGC CAAGCAGGAA CAAGGCGTGG TAACTCGCTT CCCTGCTCTG
CTTTATCCGC TGGGTAACTG GATCTGCCTG CTGTTTATGG CGGCGGTACT GGTGATTATG
CTGATGACCC CAGGAATGGC GATTTCGGTA TACCTGATCC CGGTATGGCT GATCGTGTTA
GGTATCGGCT ATCTGTTTAA AGAGAAAACC GCCAAAGCCG TAAAAGCGCA TTAA
 
Protein sequence
MMEGQQHGEQ LKRGLKNRHI QLIALGGAIG TGLFLGSASV IQSAGPGIIL GYAIAGFIAF 
LIMRQLGEMV VEEPVAGSFS HFAYKYWGSF AGFASGWNYW VLYVLVAMAE LTAVGKYIQF
WYPEIPTWVS AAVFFVVINA INLTNVKVFG EMEFWFAIIK VIAVVAMIIF GGWLLFSGNG
GPQATVSNLW DQGGFLPHGF TGLVMMMAII MFSFGGLELV GITAAEADNP EQSIPKATNQ
VIYRILIFYI GSLAVLLSLM PWTRVTADTS PFVLIFHELG DTFVANALNI VVLTAALSVY
NSCVYCNSRM LFGLAQQGNA PKALASVDKR GVPVNTILVS ALVTALCVLI NYLAPESAFG
LLMALVVSAL VINWAMISLA HMKFRRAKQE QGVVTRFPAL LYPLGNWICL LFMAAVLVIM
LMTPGMAISV YLIPVWLIVL GIGYLFKEKT AKAVKAH