Gene Acid345_3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3351 
Symbol 
ID4071269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3974905 
End bp3976362 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content60% 
IMG OID637985373 
Productamino acid transporter 
Protein accessionYP_592426 
Protein GI94970378 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGGCAA TCATCAGCCC GACGCCGAAA TCAGCGCCGA GTAACACTCC GCAACTGGCG 
CGCGACCTGC GCGTAAGCCA CGCCACCGCC GTCGTCGTCG GCACCATCAT CGGCAGCGGC
ATTTTCCTCG TACCCGCCGA GATGATGCGC GCCGTCGGCA CCGCGAAGCT CGTCTATCTC
GCATGGATCG TCGGTGGCAT CCTGTCGTTC CTAGGCGCGC TGACGTATGC AGAACTCGGC
GCGATGAAGC CGCAATCCGG CGGCGAATAT GTGTACGTGC GCGATGCCTA CGGCCCGCTG
ATGAGCTTCC TCTATGCGTG GTCATGGTTC GTCATCGCGA AGCCCGGCTC CATGGCGACC
ATCGCAACCG GCATGATGCA GATTCTTGGC GGCTATCCCG CGCTGTCGTT CCTGCCAAAA
AACGTCGTCT CGGGAGTGCC ATTCACCTAC GCGCAGCTGG CGGCCGTAGC GCTCATCATT
TTCATCTCGG CCGTGAACTA CATCGGCGTG AAGAAAGCCG GACAGTTCCA GGTAGTCTTC
ACCGTCCTGA AGCTCGCAAT CATCTTCGGC GTGATTGTCG TTGGTTTCTT CGCGGGCCAC
GGCTCGTGGT CGAACTTTAC AACCAGCTTC ACGGGTGCGA CTGGCGGCAT TGCCGGCTTC
ATGATCGCGC TCGTTGCCGC GCTCTGGGCG TATGACGGCT GGAACGACAT CAACATGGTC
GCCGAGGAAA TCGACCACCC CGAGCGCAAC GTGCCGATCG CGCTGATTGT CGGCGTGGGC
ATCGTGGCAG CGTTGTACAT GCTCCTCAAC GCCGCAGTGC AATATGCGCT TCCAGCGCAG
GCCATCGCGA TGTCGAAGCG CGCGGCATCG GATGCAGTCC TGGTCTCAAT TGGCGCAGGC
GCGGCCTCGA TATTCGCGGC ACTCATGGCG ATCCAGATGT TGGCAACGAT CAACGGCACC
ACGCTCAGCG GCGCAAGAAT TCCGTATGCC CTGGCGCGCG ACGGCTATTT CTTCGAGGCC
ATCGGCAAAG TGCATCCGCG CTACCTCACG CCTGCAAATG CGATCGTCTT CCAGGGAGCG
CTGGCGGTCA TTCTCGTGTC GCTGGTCGGG AAATTCCAGC AGCTGTTCTC GTTAACAATC
TTCGCCGAGT GGCTGTTCTA CATGATCGCG ACGAGCACCG TCTTCGTCTT CCGGCGGCGC
GAGCCGAACG CGAATCGTCC CTACAAAACG TGGGGATATC CCGTCGTTCC CGCAGTCTTC
ATCGCTGCAG CCGCGATGCT GCTCTGCTAC ACCTTCGTCG ACAACCTGAA GTACGCCATG
ATTCCCACGA CGTTGATCGG TCCGCCGTTG AATTCAATTT CTACCGGCGG CGCGCTGGTA
ATTCTGCTCG GCGTGCCGGT GTTCTGGTGG TTCGCAAGGC AAAAATCTTT AACCACGAAG
GGCACGAAGG AATTGTAG
 
Protein sequence
MEAIISPTPK SAPSNTPQLA RDLRVSHATA VVVGTIIGSG IFLVPAEMMR AVGTAKLVYL 
AWIVGGILSF LGALTYAELG AMKPQSGGEY VYVRDAYGPL MSFLYAWSWF VIAKPGSMAT
IATGMMQILG GYPALSFLPK NVVSGVPFTY AQLAAVALII FISAVNYIGV KKAGQFQVVF
TVLKLAIIFG VIVVGFFAGH GSWSNFTTSF TGATGGIAGF MIALVAALWA YDGWNDINMV
AEEIDHPERN VPIALIVGVG IVAALYMLLN AAVQYALPAQ AIAMSKRAAS DAVLVSIGAG
AASIFAALMA IQMLATINGT TLSGARIPYA LARDGYFFEA IGKVHPRYLT PANAIVFQGA
LAVILVSLVG KFQQLFSLTI FAEWLFYMIA TSTVFVFRRR EPNANRPYKT WGYPVVPAVF
IAAAAMLLCY TFVDNLKYAM IPTTLIGPPL NSISTGGALV ILLGVPVFWW FARQKSLTTK
GTKEL