Gene Acid345_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3333 
Symbol 
ID4070295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3951418 
End bp3952860 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content61% 
IMG OID637985355 
Productamino acid transporter 
Protein accessionYP_592408 
Protein GI94970360 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.326771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA CTGAGCGCTC GCAGCATGAT GACGAGCACC TCGTCCGCGG GCTGAGCCTG 
GGCGGCGCGA CTGCGCTCAA TATGATCGAC GTGATCGGCA TTGGGCCGTT CATCACGATC
CCGCTGATCA TCAGTGCCAT GGGCGGCCCG CAGGCGATGC TGGGCTGGAT CTTCGGCGCG
CTGCTCTCGC TTTGCGACGG CCTCTGCTGG GCCGAACTCG GCGCCGCCAT GCCGGGCTCC
GGCGGCAGCT ATCGCTACCT CAACGAAATT TACGGTCGCC AGAAATGGGG ACGCCTGCTC
TCGTTCCTCT TCATCTGGCA GCTCTCGTTC TCGGCGCCGC TTTCCATCGC ATCAGGATGT
ATTGGCTTCT CGCAGTATGC GAGCTACTTG AAGCCGAGCC TCGAACACGC ATGGATCTCG
CATCCGCGCT TCATGGTTGG GCCGGTGACG GTGATGTCGA TTGCGACGGT CATCGTGGTC
GTCTTCCTGC TCTATCGCGG CGTAGTGCAG ATCGAAAAGA TCTCGAAATT TCTCTGGGTC
GGCGTGATGG GAACCATGGC GTGGATCATC TTCGCCGGGC TCACCCACTT CCAGCCGTCG
CGCGCGTTCG ATTTTCCGCC GGGCGCGTTT ACGCTCTCGC ACAATTTCTT TCTCGGTCTT
GGCGCGGCGA TGCTGATCGC TACCTACGAT TTCTGGGGCT ACTACAACAT CGCGTTTCTT
GGCGGCGAGG TACGCGACCC GGAGCGCAAT ATCCCGCGCG CGATGCTGTA TTCCATCGTG
ATTGTCGGCG TGCTCTACGT GGTGATGAAC ATCAGCATCC TCGGCGTGAT GCCGTGGCGC
GAGTTGGCGC AGACCGCGCA ATCGAACACG CGGTATTACA TCGTCGCGAC CATGATGGAG
CGCCTCTACG GCCACTGGGC CGGGGTGCTG GTGGCGCTGC TCATCATGTG GACGGCGTTT
GCCTCGGTGT TCTCGCTACT GCTGGGCTAT TCACGCGTTC CCTACGCGGC AGCGCGCGAT
GGCAACTACT TCAAGCCCTT TGCGCGCATC CATCCCACGC AGAAGTTTCC GACCGTGTCG
TTGCTGGTGC TGGGTGGCGT GGCGATTCTT TGCTGCTTCC TGCGACTAGC CGATGTCATC
GCGGCGCTGG TGGTGATTCG CATCCTGCTG CAGTTCGTGG TGCAGATCCT CGGGCTGCTG
TATTGGCGTT GGTCGCGGCC CGATGCGCCT CGTCCCTTCA AGATGTGGAT TTATCCCGTT
CCCGCAGTGC TGGCGCTGGT GGGCTTTATT TATGTACTAT TCGTGCGCAC CAATTCCTGG
CAACAAGTTC GCTATGCAGT CGTAATCGTT GTCATCGGTC TCGCCATCTA TCTTGTGCGC
GCCTGGCGCA GGGGCGAATG GCCGATGCCG GGAAGATCCG CGGCCAGCGA TGTCGCGGTG
TAA
 
Protein sequence
MSNTERSQHD DEHLVRGLSL GGATALNMID VIGIGPFITI PLIISAMGGP QAMLGWIFGA 
LLSLCDGLCW AELGAAMPGS GGSYRYLNEI YGRQKWGRLL SFLFIWQLSF SAPLSIASGC
IGFSQYASYL KPSLEHAWIS HPRFMVGPVT VMSIATVIVV VFLLYRGVVQ IEKISKFLWV
GVMGTMAWII FAGLTHFQPS RAFDFPPGAF TLSHNFFLGL GAAMLIATYD FWGYYNIAFL
GGEVRDPERN IPRAMLYSIV IVGVLYVVMN ISILGVMPWR ELAQTAQSNT RYYIVATMME
RLYGHWAGVL VALLIMWTAF ASVFSLLLGY SRVPYAAARD GNYFKPFARI HPTQKFPTVS
LLVLGGVAIL CCFLRLADVI AALVVIRILL QFVVQILGLL YWRWSRPDAP RPFKMWIYPV
PAVLALVGFI YVLFVRTNSW QQVRYAVVIV VIGLAIYLVR AWRRGEWPMP GRSAASDVAV