Gene Acid345_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3036 
Symbol 
ID4071943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3603262 
End bp3604656 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content61% 
IMG OID637985055 
Productamino acid transporter 
Protein accessionYP_592111 
Protein GI94970063 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0805799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGACCG CGTCGTCCCC GATGAGCGCT TCTCCTGCCC CGATCGACCC CCATCATCTC 
CCCAGCGAAA AACGCATTCA TCTCCTCGCC CTTGCCGCCA TCATTTTCTT CACCACGTGC
GGCGGAGCCT TCGGACTCGA GCCGCTCATC GGCGCTGTTG GCCCTGCGCT CTCGCTCGTC
TTCATCCTTG TCACTCCGCT GCTCTGGAGC CTCCCAACCG CATTGATGGT CGCCGAACTC
ACCGCGATGA TGCCCGAAGA AGGCGGCTTC TATGTCTGGA TCCGCGAAAC GTTTGGCTCG
CTTTGGGCCG TGCAACAGGC CTGCTGGACG ATGACCATCT CCGTCATCTG GCTGGCGATG
TATCCCATCC TCTTCGTCGG CTATCTCGGA TTCCTAATCC CGGAGATCGC CGCACCCGCG
CACCCGTTCC TCCGCTGGGG AATTACGGGC CTCATGATTG CTAGTGGGCT GCTATTGAAT
CTCCGCGGTT CACATACCGT CGGAGGCGCC GCCCAGATCG TCACCAGCAT CGTGCTCGGC
ACCTTCGTCG TCATGCTCAT TACATGGCTG GCGCGCCTCC ATAATCCCCG ACTCATCCCT
GGCATCCTTC ATCGCGATAT CCGCACACCG CATCCCGGCG CGCTGCTGCT CGGGATCTCC
TTCACAGTCT TCAATTATTC CAGTTGGGAT AGCGTCTCAA CCTACGCTGG CGAAGTCGAT
CAGCCGCAGC GCAATTACCC GCGCGCCATC ATCTATGCGC TCGCCCTCAC CGTGCTTTGC
TATTTGATTC CGGTGGCCGC TGGCATCACC GTTACCACTG ACGCCAACAT CTGGAGCAGC
GACCAAGGCT GGCCGGTAAT CGCGCGCCTC ATCGGCGGAA CCTGGCTGGG TACACTCATG
GCCGGCGCGG GCCTTGCTTC GATCTGGGGA TTGTTCAACG GCCAGCTTCT CTACGTCTCG
CGCCTTCCGT ATGCGCTCGC CCGCGACGGA TGGCTGCCTA AGATTTTCGC GAAAACTTCC
ACCGACACCG CCCCGCCTCG TGCCGCGCTC TTCGCTTTTT GCGGCATCAC CGCACTCTTC
ACCGCGTTCT CTCTCGGGTC CCTGGCGATC ATCCAATGCG TGCTTTACTG TGCTGCGCTT
ACCCTCGATT TTCTCGCTCT CTTCATGCTC CGGATTCGAC GTCCGCACGC CGAGCGCAGC
TTCAGCGTGC CCGGTGGCTG GCTCGGCATC GCCTACGTCT GCGTTTCTCC GTTTATCTTT
GCGTTGTTTG TGCTGTATGC CGGTCTGCGC GACTGGCGAG CCTATCCGGG ACAACTTCTC
GTCATCCCGC TAGTCACCGC CGCTGGTCTT TGCCTGTACT ATTCCCGGCG CACTCGAGCT
AGTGCCGGTT CATGA
 
Protein sequence
MTTASSPMSA SPAPIDPHHL PSEKRIHLLA LAAIIFFTTC GGAFGLEPLI GAVGPALSLV 
FILVTPLLWS LPTALMVAEL TAMMPEEGGF YVWIRETFGS LWAVQQACWT MTISVIWLAM
YPILFVGYLG FLIPEIAAPA HPFLRWGITG LMIASGLLLN LRGSHTVGGA AQIVTSIVLG
TFVVMLITWL ARLHNPRLIP GILHRDIRTP HPGALLLGIS FTVFNYSSWD SVSTYAGEVD
QPQRNYPRAI IYALALTVLC YLIPVAAGIT VTTDANIWSS DQGWPVIARL IGGTWLGTLM
AGAGLASIWG LFNGQLLYVS RLPYALARDG WLPKIFAKTS TDTAPPRAAL FAFCGITALF
TAFSLGSLAI IQCVLYCAAL TLDFLALFML RIRRPHAERS FSVPGGWLGI AYVCVSPFIF
ALFVLYAGLR DWRAYPGQLL VIPLVTAAGL CLYYSRRTRA SAGS