Gene Acid345_3352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3352 
Symbol 
ID4071270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3976319 
End bp3977758 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content61% 
IMG OID637985374 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_592427 
Protein GI94970379 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCTTCC GCAAGTCACA GCGCACCCTG AAAGATTATT TCCTCGCAGA CCGATCCATT 
CCCTGGTGGG CGCTGGCGCT CTCCATCGTC GCCGCCGAAA CCAGCACTCT GACCCTGGTC
AGCGTCCCGG GTCTCGCTTA CGACACCAAC TTCACCTTCC TGCAACTCGC CTTCGGATAC
CTCGTAGGGC GAGTCATCAT CTGCATCATC CTGCTCCCGC AATATTTCAA GGGATACCTG
TTCACCGCCT ACCAACTCAT CCAGCGTCGC TTCGGCGGCA AGCTGCGCAC CTACACCGCT
GGCCTCTTCC TCATCACCCG CGCCGCTGCC GAAGGCGTTC GCGTTTACGC TGTCGCGTTG
GTGATCGGTA TCGCCCTCGG CCCGCAACTC GGTAATTTCA GCGACTTCAG CCGCGACCTG
ATCGCCATCT CCATTGTCTG CATATTGACG CTCATCTATA CGTTCGAGGG AGGCCTCGCG
GCGGTGATCT GGACCGACGT CATCCAGACC TTCATCTACA TCGGCGGGAC GATCGTCGGT
TTCTTCACCA TCATCCATCT CGTCCCCGGC GGCTGGGGCG CGATTCACGC CACTGCCGCG
GCGGCAGGGA AGTTCAGGGT CTTCGACTTC AGCTGGGATT TCTGGAAGAC CTACACCTTC
TGGTCCGGCG TAATCGGTGG CGCGTTCCTG ACGACCGCGA GTCACGGCAC CGATCAACTC
ATCGTGCAGC GACTATTAGC GGCGCGCAGC CTGAAGGATT CACGCTGGGC GCTGCTGGCG
AGCGGCGTGG CTGTGCTTCT CCAGTTCGCG CTGTTCCTCT GCGTCGGCGC GATGCTCTGG
GTCTTCTACC ACGGCGTCGC CTTCGCCAAG ACGGACCGCA TCTTCCCCAC GTTCATTGTC
ACTGAGATGC CGCACCTCGT CTCCGGCCTG CTGATCGCCG CGATTTTAGC GGCTGCCATG
TCGAACCTAA GCGCGGCCCT GAATTCGCTC ACTTCGACGT CGATCATTGA TTTCTGGCTG
CGGCTCTATC CCAGTTCCAC CGAAAAAACG CGCATGCTAC TGTCTCGCGT CGGTACCATC
GCCTGGGGAC TCGTTCTCTT CTGCCTTGCC ATTCTTTCTC GACGCGGCGG TAAGGTCGTC
GAACTCGGCT TGACGATCGC TTCCGTCGCG CAGGGCGCCA TGCTCGGCAC GTTCCTGCTT
GGCGTGCTGA CCAAGCGCGC GAACCAACTG GGCACAGCGA TCGGCCTCAC CGCCGGCTTC
CTGGTGAACA TCTACATCTG GCAGGGAAAA GCGATTGTGC CGCAGTTTGC GCTGCCGCGC
GCGGTCCCGT TCCCGTGGTA TGTGGCCATC GGATCGTTTG TGACGTTCGC CATCGGTTAT
AGTGCGAGTC TACTTCTTGG AGGCAATCAT CAGCCCGACG CCGAAATCAG CGCCGAGTAA
 
Protein sequence
MRFRKSQRTL KDYFLADRSI PWWALALSIV AAETSTLTLV SVPGLAYDTN FTFLQLAFGY 
LVGRVIICII LLPQYFKGYL FTAYQLIQRR FGGKLRTYTA GLFLITRAAA EGVRVYAVAL
VIGIALGPQL GNFSDFSRDL IAISIVCILT LIYTFEGGLA AVIWTDVIQT FIYIGGTIVG
FFTIIHLVPG GWGAIHATAA AAGKFRVFDF SWDFWKTYTF WSGVIGGAFL TTASHGTDQL
IVQRLLAARS LKDSRWALLA SGVAVLLQFA LFLCVGAMLW VFYHGVAFAK TDRIFPTFIV
TEMPHLVSGL LIAAILAAAM SNLSAALNSL TSTSIIDFWL RLYPSSTEKT RMLLSRVGTI
AWGLVLFCLA ILSRRGGKVV ELGLTIASVA QGAMLGTFLL GVLTKRANQL GTAIGLTAGF
LVNIYIWQGK AIVPQFALPR AVPFPWYVAI GSFVTFAIGY SASLLLGGNH QPDAEISAE