Gene Acid345_3623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3623 
Symbol 
ID4070143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4288990 
End bp4290216 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content61% 
IMG OID637985646 
Productcitrate transporter 
Protein accessionYP_592698 
Protein GI94970650 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.893629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.562727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCG CGCTGACTCA CACCCTCGCC TGGGCGATAT TCCTGATCAG CTACTTCGTA 
TTCGCCGTGG GGCGACTGCC GGGGACGCGC CTGGATCGCG CGGGGATGGC GTTTATCGGC
GCGGTGGGAA TGTTTGTCAT CGGCGCCCTG AACGCGAAGA CCGCGATCGC ATCGCTCGAC
TACGAGACGC TGGTGCTGCT GGCCTCGATG ATGGTGCTGG TAGCGGTGCT GCACCTCGAG
GGCTTCTTCG AGTGGGTCAC ACGGCAGATC GTGGCACGAC TGGCGCCGGG CCAATTGTTG
CCGGGAGTGA TTTTTTCGGC GGGCGTATTG TCGGCGTTCC TGGTCAACGA TGTCGTGTGC
TTATTCATGG CGCCGCTGAT TCTGCGGGTG ACGAAACAGA TGGGACGCAA TCCGCTGCCG
TTCTTGCTGG CACTGGCGAC GGCTTCGAAC ATCGGAAGTT CCGCGACCAT CACCGGCAAC
CCGCAGAACA TTTTGATCGG GTCGGTTTCG CAGATCGGCT ACCGCGATTT TCTGTTTCAC
CTCGGGCCGG TGGCCGTGGT GGGAATGTTT CTGGATTGGG CTGTGATTGC CGTGCTGTGC
CGGAAGCAAC TGAGCGAGCG ATTGCCGATT GAGGCGAACC TCGAGGCGAA GCAGGGATTG
GATGGGCTGC TGGCGCCGCT GCTCATCGCG GGCGGGATCA TCGCGGCGTT CCTCGGGGGC
TTGAATCCAG CGCTGGTGGC GGCGACGGGC GCTGCCGTGC TGCTGCTGTT GCGCTCGCGC
TTGCTGGAGA AGATCTATCG CGAGGTGGAC TGGGCGTTGC TCGTGCTATT CATCGGGCTG
TTCCTCATCA TCGGAGCTGC GGAGCAGACA GGCATAGCAG CGACATTGCT GCGCGCGGCG
GAATGGATGA ACCTGCATAA TCTCGGGATC TTCAGCGTTA CCGTGACGCT GCTGTCGAAC
ATGGTTAGCA ATGTTCCGGC GGTCATGCTG CTGAAGGACC TGGTGAAGCA GTTCCCTAAC
GCGCATCAAT TCTGGCTGGC GTTGGCGATG GCGAGCACGC TGGCGGGTAA CCTGACGATC
ACCGGCTCAA TCGCGAACAT GATTGTGGTC GAATCGGTCC GGCCACAACT GCGGATTACG
TTCAAGGACT ACCTCGTGGT CGGCGTTCCC ACTACAATCC TTACCATTGC GGTGGGAACG
CTGTGGATTG CGTTCTTCGC GCACTAG
 
Protein sequence
MSIALTHTLA WAIFLISYFV FAVGRLPGTR LDRAGMAFIG AVGMFVIGAL NAKTAIASLD 
YETLVLLASM MVLVAVLHLE GFFEWVTRQI VARLAPGQLL PGVIFSAGVL SAFLVNDVVC
LFMAPLILRV TKQMGRNPLP FLLALATASN IGSSATITGN PQNILIGSVS QIGYRDFLFH
LGPVAVVGMF LDWAVIAVLC RKQLSERLPI EANLEAKQGL DGLLAPLLIA GGIIAAFLGG
LNPALVAATG AAVLLLLRSR LLEKIYREVD WALLVLFIGL FLIIGAAEQT GIAATLLRAA
EWMNLHNLGI FSVTVTLLSN MVSNVPAVML LKDLVKQFPN AHQFWLALAM ASTLAGNLTI
TGSIANMIVV ESVRPQLRIT FKDYLVVGVP TTILTIAVGT LWIAFFAH