Gene Acid345_2672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2672 
Symbol 
ID4071926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3148305 
End bp3150104 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content59% 
IMG OID637984689 
ProductNa+/solute symporter 
Protein accessionYP_591747 
Protein GI94969699 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTTA CGCTCATTGA TTGGATCGTC ATTGTCGCTT ATTTCAGCGT CAATCTCCTG 
ATCGGGTTCT ACTACGCCCG CAAAGCCGGC AGTTCCGTGG ATGAATTCTT TATCTCGGGA
CGCGAGGTGT CGTGGTGGCT GGCAGGAACT TCGATGGTGG CGACGACGTT TGGCGCCGAT
ACCCCGCTGG TGGTGACGGG GATGATCTTC AAGTACGGCA TCGCCGGCAA CTGGCTGTGG
TGGAACCTGG CGCTGAGCGG GATGCTGACG GTCTTCTTCT TCGCGCGATT GTGGCGGCGC
TCGGGCGTGC TCACCGACAT GGAGTTTGCC GAGCTTCGCT ATGCAGGGAA GCCCGCAGCG
TTCCTGCGCG GGTTCCGCGC GCTGTATCTC GCGTTGCCGG TGAACACGAT CATCATGGGT
TGGGTGAACC TGGCGATGGC GAAGGTGCTG GAACTCACGC TGGGCATCCA CAAGATGAAC
GCGGTGATGT TCTGCCTGGC GATGACGCTG CTGTATGCGG CGATCTCGGG ATTGTGGGCG
GTGCTGTGGA CGGACCTGCT GCAGTTCATT CTGAAGATGA GCATGGTGAT TGCGCTCGCC
GTTTTCGCGG TGAAGGCGAT TGGCGGCATT GGAGCGATCA AGACGGCGCT GGCGACGCAG
CATCCGACGA TTGGGACCTC GTACCTGGCA TTTGTGCCGG ATTGGACGTC GGGATGGATG
GTGATGTTCC TCGTCTATCT CAGTATTAAC TGGTGGGCGA GCTGGTATCC GGGGGCAGAG
CCGGGCGGCG GCGGATATAT CGCGCAACGA ATTTTCTCCG CGAAGGACGA GAAGAACTCG
CTGGGCGCAA CGTTGTGGTT CAACTTCGCG CATTACGCAT TGCGTCCGTG GCCGTGGATT
CTGGCGGCGT TGGTGGCGGT GGTGATGTTC CCGGGGTTGA AAGATCCCGA GACCGGATAC
ATCAAGGTGA TGATCGCGTA TTTGCCGCCT AGCCTGCGCG GGCTGATGCT GGCAGGATTT
GCGGCGGCGT ATATGTCCAC GATCGGCACG CACATAAACC TTGGCGCGTC GTACTTGATC
AACGACTTCT ATCGGCGTTT CATGAAGCCA AGCGAGAGTG AGAAGCATTA CGTGGTGGCG
TCGAGACTGG CGACGGTCTT CGTGACCGTG CTCTCGGCGG TGGCAACGTA CTACATGCAT
TCGATTGAGG GCGCGTGGAA GTTTTTGATC TCGATAGGCG CAGGCGCGGG ACTGGTGTTC
ATGCTGCGCT GGTTCTGGTG GCGCATCAAC GCGTGGAGCG AAGTGGGGGC AATGACGGCG
GCGGCAACTT CGTCGCTGTT CCTGCAATCG CGGTTCGCGA CCGGCGTGGT GGAGATCTTC
CGGCGCTTCG ATCCGAAGCT TGATCCGGGT CCGCTCGACA GCAGCACCCC GCATGGATTT
GCGTGGGTAA TGCTGCTGAC GACCGGGATT ACGACAATCA GCTGGCTGGT AGTGACCTTC
CTAACGAAGC CGGAGCCGGA AGCGAAGCTA CGCGAGTTCT ATCGCAAGGT GCAGCCCTCC
GCATTTGGGT GGCGGCGAAT TGCGGAACTC GAGGGTGAAA CCTCGAAGCA GAGCCTGCTG
TGGTCGGCCG TGGATTGGGT GATGGGTTGC GGCATGATCT ATTGCTCGCT GTTTGGGATT
GGGCGATTGA TCTTCGGGCC GGTGTGGCAA GGGTTTGTGC TGCTGGCGAT TGCGGCGTTC
TGCTTGTGGT TCTTGTTCTG GGATTTGAAC CGAAGAGGGT GGGATACGCT GAGTAGCTAA
 
Protein sequence
MQLTLIDWIV IVAYFSVNLL IGFYYARKAG SSVDEFFISG REVSWWLAGT SMVATTFGAD 
TPLVVTGMIF KYGIAGNWLW WNLALSGMLT VFFFARLWRR SGVLTDMEFA ELRYAGKPAA
FLRGFRALYL ALPVNTIIMG WVNLAMAKVL ELTLGIHKMN AVMFCLAMTL LYAAISGLWA
VLWTDLLQFI LKMSMVIALA VFAVKAIGGI GAIKTALATQ HPTIGTSYLA FVPDWTSGWM
VMFLVYLSIN WWASWYPGAE PGGGGYIAQR IFSAKDEKNS LGATLWFNFA HYALRPWPWI
LAALVAVVMF PGLKDPETGY IKVMIAYLPP SLRGLMLAGF AAAYMSTIGT HINLGASYLI
NDFYRRFMKP SESEKHYVVA SRLATVFVTV LSAVATYYMH SIEGAWKFLI SIGAGAGLVF
MLRWFWWRIN AWSEVGAMTA AATSSLFLQS RFATGVVEIF RRFDPKLDPG PLDSSTPHGF
AWVMLLTTGI TTISWLVVTF LTKPEPEAKL REFYRKVQPS AFGWRRIAEL EGETSKQSLL
WSAVDWVMGC GMIYCSLFGI GRLIFGPVWQ GFVLLAIAAF CLWFLFWDLN RRGWDTLSS