Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2672 |
Symbol | |
ID | 4071926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3148305 |
End bp | 3150104 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984689 |
Product | Na+/solute symporter |
Protein accession | YP_591747 |
Protein GI | 94969699 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACTTA CGCTCATTGA TTGGATCGTC ATTGTCGCTT ATTTCAGCGT CAATCTCCTG ATCGGGTTCT ACTACGCCCG CAAAGCCGGC AGTTCCGTGG ATGAATTCTT TATCTCGGGA CGCGAGGTGT CGTGGTGGCT GGCAGGAACT TCGATGGTGG CGACGACGTT TGGCGCCGAT ACCCCGCTGG TGGTGACGGG GATGATCTTC AAGTACGGCA TCGCCGGCAA CTGGCTGTGG TGGAACCTGG CGCTGAGCGG GATGCTGACG GTCTTCTTCT TCGCGCGATT GTGGCGGCGC TCGGGCGTGC TCACCGACAT GGAGTTTGCC GAGCTTCGCT ATGCAGGGAA GCCCGCAGCG TTCCTGCGCG GGTTCCGCGC GCTGTATCTC GCGTTGCCGG TGAACACGAT CATCATGGGT TGGGTGAACC TGGCGATGGC GAAGGTGCTG GAACTCACGC TGGGCATCCA CAAGATGAAC GCGGTGATGT TCTGCCTGGC GATGACGCTG CTGTATGCGG CGATCTCGGG ATTGTGGGCG GTGCTGTGGA CGGACCTGCT GCAGTTCATT CTGAAGATGA GCATGGTGAT TGCGCTCGCC GTTTTCGCGG TGAAGGCGAT TGGCGGCATT GGAGCGATCA AGACGGCGCT GGCGACGCAG CATCCGACGA TTGGGACCTC GTACCTGGCA TTTGTGCCGG ATTGGACGTC GGGATGGATG GTGATGTTCC TCGTCTATCT CAGTATTAAC TGGTGGGCGA GCTGGTATCC GGGGGCAGAG CCGGGCGGCG GCGGATATAT CGCGCAACGA ATTTTCTCCG CGAAGGACGA GAAGAACTCG CTGGGCGCAA CGTTGTGGTT CAACTTCGCG CATTACGCAT TGCGTCCGTG GCCGTGGATT CTGGCGGCGT TGGTGGCGGT GGTGATGTTC CCGGGGTTGA AAGATCCCGA GACCGGATAC ATCAAGGTGA TGATCGCGTA TTTGCCGCCT AGCCTGCGCG GGCTGATGCT GGCAGGATTT GCGGCGGCGT ATATGTCCAC GATCGGCACG CACATAAACC TTGGCGCGTC GTACTTGATC AACGACTTCT ATCGGCGTTT CATGAAGCCA AGCGAGAGTG AGAAGCATTA CGTGGTGGCG TCGAGACTGG CGACGGTCTT CGTGACCGTG CTCTCGGCGG TGGCAACGTA CTACATGCAT TCGATTGAGG GCGCGTGGAA GTTTTTGATC TCGATAGGCG CAGGCGCGGG ACTGGTGTTC ATGCTGCGCT GGTTCTGGTG GCGCATCAAC GCGTGGAGCG AAGTGGGGGC AATGACGGCG GCGGCAACTT CGTCGCTGTT CCTGCAATCG CGGTTCGCGA CCGGCGTGGT GGAGATCTTC CGGCGCTTCG ATCCGAAGCT TGATCCGGGT CCGCTCGACA GCAGCACCCC GCATGGATTT GCGTGGGTAA TGCTGCTGAC GACCGGGATT ACGACAATCA GCTGGCTGGT AGTGACCTTC CTAACGAAGC CGGAGCCGGA AGCGAAGCTA CGCGAGTTCT ATCGCAAGGT GCAGCCCTCC GCATTTGGGT GGCGGCGAAT TGCGGAACTC GAGGGTGAAA CCTCGAAGCA GAGCCTGCTG TGGTCGGCCG TGGATTGGGT GATGGGTTGC GGCATGATCT ATTGCTCGCT GTTTGGGATT GGGCGATTGA TCTTCGGGCC GGTGTGGCAA GGGTTTGTGC TGCTGGCGAT TGCGGCGTTC TGCTTGTGGT TCTTGTTCTG GGATTTGAAC CGAAGAGGGT GGGATACGCT GAGTAGCTAA
|
Protein sequence | MQLTLIDWIV IVAYFSVNLL IGFYYARKAG SSVDEFFISG REVSWWLAGT SMVATTFGAD TPLVVTGMIF KYGIAGNWLW WNLALSGMLT VFFFARLWRR SGVLTDMEFA ELRYAGKPAA FLRGFRALYL ALPVNTIIMG WVNLAMAKVL ELTLGIHKMN AVMFCLAMTL LYAAISGLWA VLWTDLLQFI LKMSMVIALA VFAVKAIGGI GAIKTALATQ HPTIGTSYLA FVPDWTSGWM VMFLVYLSIN WWASWYPGAE PGGGGYIAQR IFSAKDEKNS LGATLWFNFA HYALRPWPWI LAALVAVVMF PGLKDPETGY IKVMIAYLPP SLRGLMLAGF AAAYMSTIGT HINLGASYLI NDFYRRFMKP SESEKHYVVA SRLATVFVTV LSAVATYYMH SIEGAWKFLI SIGAGAGLVF MLRWFWWRIN AWSEVGAMTA AATSSLFLQS RFATGVVEIF RRFDPKLDPG PLDSSTPHGF AWVMLLTTGI TTISWLVVTF LTKPEPEAKL REFYRKVQPS AFGWRRIAEL EGETSKQSLL WSAVDWVMGC GMIYCSLFGI GRLIFGPVWQ GFVLLAIAAF CLWFLFWDLN RRGWDTLSS
|
| |