Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1083 |
Symbol | |
ID | 4570027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1223760 |
End bp | 1225526 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639765680 |
Product | Na+/solute symporter |
Protein accession | YP_911548 |
Protein GI | 119356904 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0991073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGACAT TAACGCTTCT TGACTACTCG TTTATTGTCG GATACCTTCT CCTGACGCTG TTTATCGGCT TGTTGTTTTC AAAAAAAGCC TCTGAAAATG TTGGCGAATT TTTTCTTTCA GGCAGAAAAC TTCCCTGGTG GATTGCCGGA ACCGGTATGG TTGCAACAAC CTTTGCCGCT GACACGCCGC TTGCCGTAAC AGGATTGGTA GCAAAAAACG GCATTGCCGG AAACTGGGTC TGGTGGACGT TTGTCTCCGG AGGAATGCTG ACTGTTTTCT TTTTCGCAAG ACTCTGGCGC CGATCAAACA TCCTTACCGA CCTTGAATTT ATCGAAATTC GATACAGCGG TACCGCCGCC AAATTTCTTC GCGGATTCAA GGCGCTCTAT TTCGGACTTT TTATCAATTC GATCATTATC GGCTGGGTTA ATCTTGCGAT GTACAAGATA ATAAGGATTA TGGTTCCTGA ACTCAACCCC GAAATCACGA TCATAGCTCT TGTTGTTCTC ACGACCGTCT ACTCAGGACT TTCCGGGTTA TGGGGCGTTT CTATTACTGA CGCGGTGCAG TTCATTATCG CCATGACCGG CTGCATCATC CTTGCCGTTC TTGCACTGCA GGCACCTGAA GTTGGAGGCA TCTCCGGTTT ACAGCACGCA CTTCCAGCCT GGATGTTTGA CTTTTATCCC TCACTTTCCG GCTCCCGAGA AACGCCCGTT CAGGATAGCG GAGCGTTCTC GCTCCCCTTT GCATCGTTTG CCGCAATGGC ATTTGTCCAA TGGTGGGCGT CATGGTACCC TGGTTCCGAA CCGGGAGGCG GCGGCTATAT TGCCCAGCGA ATGATGAGTG CCAAAGATGA AAAGCACTCT CTTCTTGCGA CACTCTGGTT TACCGTTGCT CACTACTGTC TTCGTCCATG GCCATGGATC ATTGTCGCTC TTGCGAGTCT TGTCATGTTC CCCGACCTCC CCCTTGATCA GAAAGAGGAC GGATTTGTCT ATGTGATGAA AACCGTTCTG CCTTCGGGAC TGAAAGGACT GCTTGTGGCC GCATTTCTTG CCGCGTACAT GTCAACCCTT TCAACCCATC TCAACTGGGG AACAAGCTAC CTGATTAACG ATTTCTATCA ACGGTTTCTG AAGCCGGAGG CAGAAGCCGC GCATCTGGTA AAGGCGTCAA AAATCGTTAC CGGTCTGATC GCAATTTTTT CGCTTTTTAT CACGTTCTAT GTACTTAAAA CCATTACAGG GGCATGGGAA TTCATTATTC AATGTGGTGC CGGCACAGGT TTTGTACTGA TTTTCCGCTG GTTCTGGTGG CGACTGAACG CGTGGAGTGA AATCACCTCA ATGCTTGCAC CTTTTCTTGC CTATGCATGG ATTTCCTTTT TCACCTCAAT CACCTTCCCG GACTCATTGT TCATTATCGT CCTGTTTACA ATATCGTCAA CACTGATTGT AACATTTTTA ACCCCTCCGA CCGATACCGA CCGCTTGCAG TCATTTTACA GAACCACAAG GGTTGGCGGC ATTCTATGGA AAAAGATTTC CGTGACCATG CCGGAGGTTG AATCAGACAA GGGATTTATC ATGCTTTTCA TTGATTGGCT GCTCGGCATT ATCCTTGTTT ACGCCGCACT GTTCGGAACC GGAAAACTCA TCTTTGGAGA TCCAATGCAA GCCGTTATCT ACTTTGCAAC CGCCCTCGGT GCAGGAACGC TCATCTACAA AGACCTGAAC CGGCGAGGAT GGAACAATCT GAAATGA
|
Protein sequence | METLTLLDYS FIVGYLLLTL FIGLLFSKKA SENVGEFFLS GRKLPWWIAG TGMVATTFAA DTPLAVTGLV AKNGIAGNWV WWTFVSGGML TVFFFARLWR RSNILTDLEF IEIRYSGTAA KFLRGFKALY FGLFINSIII GWVNLAMYKI IRIMVPELNP EITIIALVVL TTVYSGLSGL WGVSITDAVQ FIIAMTGCII LAVLALQAPE VGGISGLQHA LPAWMFDFYP SLSGSRETPV QDSGAFSLPF ASFAAMAFVQ WWASWYPGSE PGGGGYIAQR MMSAKDEKHS LLATLWFTVA HYCLRPWPWI IVALASLVMF PDLPLDQKED GFVYVMKTVL PSGLKGLLVA AFLAAYMSTL STHLNWGTSY LINDFYQRFL KPEAEAAHLV KASKIVTGLI AIFSLFITFY VLKTITGAWE FIIQCGAGTG FVLIFRWFWW RLNAWSEITS MLAPFLAYAW ISFFTSITFP DSLFIIVLFT ISSTLIVTFL TPPTDTDRLQ SFYRTTRVGG ILWKKISVTM PEVESDKGFI MLFIDWLLGI ILVYAALFGT GKLIFGDPMQ AVIYFATALG AGTLIYKDLN RRGWNNLK
|
| |