Gene Acid345_3114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3114 
Symbol 
ID4070228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3701019 
End bp3702359 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content61% 
IMG OID637985133 
Productamino acid transporter 
Protein accessionYP_592189 
Protein GI94970141 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.988661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA CCGCCGCGAA GACCGCGATG AAGCGCGCCA GCCTGCTGCC CTTCGTCTTC 
GTGATGTACT CGTACACCAC CGGCGGACCC TTCGGGCTCG AAGGCCAGGT CACCACCTCC
GGCCCCGGCA TGACGCTCAT CTACCACCTG CTGCTCCCGT TTTTCTGGTG CATCCCGGTC
TCGTTCGTCT CCGCTGAACT GACGACCGCG ATGCCCGTGG AAGGCGGCTT CTACCGCTGG
TCCCGCGCGG CCTTCGGAGA CTTCTGGGGC TTCCTCGCCG GATGGTGGAA CTGGTGCGCG
TCTTTCATTC TCGGCGGCGT GTACGCGGTC ATGTTCGCCG ACTACATGCA GTTCTATTTT
CCGCAACTCA AAGCACCGCT GGCACACTTT GCGGTTGCGC TCGCGATGAT CATAGTCATC
ACGTTCGTGA ACATCGTCGG CATTGATGCC GTCGGCAAAG TCGCTACTGT GTTTGGCGTA
TTGATTCTCG CTCCCATCGC CGTCATGTGT GTGTGGGGCG CGACGAAGTG GCAGCACAAT
CCATTCCTGC CATTGATTCC TCCGGGTGCG ACGCCGAAAC AAGTTGCCGG TGTCGGACTT
GCCCTCGGCC TCTGGCTCTA CTCCGGTTTC GAACAGCTCT CGACGGTTGC AGAAGAAGTC
GAAGACCCGC AGCGCACATT CCCACGCGCG CTCGCGTGGG CCGTGCCGAT GGCGATGGCC
ACCTACTTCC TCCCCACGCT CTTCTCGCTC GCCGCAGTCG GCGATTGGCA CGCATGGAAA
GACGGCTACT TCTCCACCGC CGCCTTCGCC ATCGGCGGAC ACTGGCTTGG CTTCGCCGTG
AACCTCGCTG CGTTGATCAC TGCTGTCTCG CTGCTGAATG GCACCGTGAT CGCTTCCACG
CGCATGCCCT TCGCCATGGC CGAAGACGGC TATCTCCCGC GTTTCCTGGC GAAAACCCAC
GCACGCTTTA AAACGCCGTG GCTCGCGATT ATTTGCTCGG CCTGTGTTTA TGCGGCGCTC
TCGTGGAAGA GTCTCTCGGC GCTCATCATT GTCTATTCGT GGCTGCGTGT TGCGACTACG
TGGATGACCG TCATCGCCGC GTGGCGACTG CGCGCGAAAG ATCCGAACAT GAAGCGGCCC
TTCCGCATTC CGTGGGGAAT CGCCGGTGTG GCGTATTGCG TGATTGCCCC GCTCATCATC
GGCGCCATCG CGCTTTCGGC CAGCGAAAAT CCCATCGGCG GATTGCTCTC GCTCGCCCTC
GGTCCTTTGA TGTATCCCGT AGTTAAGTTT TTTGCGCGCC GCGCCGCGCG CGCCGATCAA
GCCGCAGCGG CCATTAGCTG A
 
Protein sequence
MSQTAAKTAM KRASLLPFVF VMYSYTTGGP FGLEGQVTTS GPGMTLIYHL LLPFFWCIPV 
SFVSAELTTA MPVEGGFYRW SRAAFGDFWG FLAGWWNWCA SFILGGVYAV MFADYMQFYF
PQLKAPLAHF AVALAMIIVI TFVNIVGIDA VGKVATVFGV LILAPIAVMC VWGATKWQHN
PFLPLIPPGA TPKQVAGVGL ALGLWLYSGF EQLSTVAEEV EDPQRTFPRA LAWAVPMAMA
TYFLPTLFSL AAVGDWHAWK DGYFSTAAFA IGGHWLGFAV NLAALITAVS LLNGTVIAST
RMPFAMAEDG YLPRFLAKTH ARFKTPWLAI ICSACVYAAL SWKSLSALII VYSWLRVATT
WMTVIAAWRL RAKDPNMKRP FRIPWGIAGV AYCVIAPLII GAIALSASEN PIGGLLSLAL
GPLMYPVVKF FARRAARADQ AAAAIS