Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1120 |
Symbol | |
ID | 6374795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1204916 |
End bp | 1206679 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642683622 |
Product | Na+/solute symporter |
Protein accession | YP_001959539 |
Protein GI | 189500069 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.13754 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.460734 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCAGC TTACCTCTCT TGATTTCAGC ATCATCGCCG GCTATCTGGT GCTGACCCTG CTTATCGGGC TCTTTTTTTC AAAAAGGGCT TCTCAGAACG TCGGCGAGTT CTTTCTTTCG GGACGTAAGC TGCCCTGGTG GATTGCCGGA ACCGGTATGG TCGCCACTAC TTTCGCCGCC GATACGCCGC TTGCCGTTAC AGGCCTTGTG GCCAAACACG GCATTGCGGG AAACTGGCTC TGGTGGACGT TTGTATCAGG AGGAATGCTC ACCGTGTTTT TCTTTGCACG ACTATGGCGA AGGGCAAACA TTCTCACCGA CCTTGAATTT ATCGAGCTGC GTTACAGCGG CAAACCCGCA CAGTTTCTCC GGGGCTTCAA GGCTCTCTAC TTCGGGCTCT TTATCAATGC GGTGATCATC GGCTGGGTGA ACCTGGCCAT GTACAAGATC ATCAGAATCA TGCTTCCGGA ACTGAATCCT GAAATATCCA TTGTTGCCTG CGTTATGCTG ACGACCCTTT ATTCAGGTCT TTCAGGCCTC TGGGGCGTAA CCATAACAGA CATGGTACAA TTCGTCATAT CCATGACGGG ATGTATTATT CTGGCCATCC TGGCGCTTCA GGCTCCGGAA ATCACACAGG CGGGGGGCAT AACAAACGCC CTGCCGGAAT GGATGTTCAG CTTTTTCCCC TCGATCACCG CAAACCCGTC CGCCGAAGGA TCAGGAGGCA CTCTTGAACT TACCTTTGCC GCGTTTGCCG CATTCGCTTT CATACAGTGG TGGGCTTCCT GGTATCCCGG CTCGGAACCG GGGGGCGGGG GCTACATCGC TCAACGCATG ATGAGCGCCA AAGACGAAAA GCATTCTCTG CTTGCCACGC TATGGTTCAT TATCGCGCAC TACTGTCTGA GACCCTGGCC ATGGATCATT ATTGGACTCG CGAGTCTGGT GCTTTTCCCT GACCTGCCTG CCGACCAGAA AGAAGACGGT TTTGTCTATG TCATGCAGTC CCTTCTTCCG CCGGGCCTCA AAGGGCTGCT GATAGCCGCT TTTCTGGCCG CGTATATGTC CACCCTCTCA ACGCACCTCA ACTGGGGGAC AAGCTATCTG GTCAATGATT TCTATAAACG CTTTATCAAA ACGGAAGCCT CTTCGGCTCA CTACGTTACC ATTGCAAAGG TGTTCACCGC ATGCGTTGCG GTTTTTTCTC TATTCATAAC CTTCTTTGTG CTGGAAACCA TCACCGGCGC CTGGGAATTC ATTATCCAGT GCGGAGCCGG CACAGGATTC GTACTCATCC TCCGCTGGTA CTGGTGGAGA CTCAACGCAT GGTCGGAGAT CGTTTCCATG ATCGCGCCGT TTGCCGCATA CGCCTGGCTT GTTCTGTATA CCGACATCAC TTTCCCGGGC TCTATCTACC TTATCGTTCT CTTTACCATA GCAGCAACGC TGCTTGTCAC CTATGCAACC CCGGCTACGG ACGAGAAACA GCTTCAGAGC TTCTACTCGG TCACCAGGGT CGGAGGATTT TTCTGGAAAA AAATATCCGA CCAGATGCCG GACGTAGTAT CTGATAAAGG TTTCTTCAGA CTTTTTCTCG ACTGGATCTC AGGCATTATT CTTGTCTATT CGATACTCTT CGGCACGGGA AAAATTATTT TCGGAGAGCC GATGGAAGCC ATAATGTACT ACGGAGCTGC CCTGCTTGCC GGCATATTCA TCTATACTGA CCTGAGTCGC AGGGGGTGGA ACCAACTGAG CTGA
|
Protein sequence | MEQLTSLDFS IIAGYLVLTL LIGLFFSKRA SQNVGEFFLS GRKLPWWIAG TGMVATTFAA DTPLAVTGLV AKHGIAGNWL WWTFVSGGML TVFFFARLWR RANILTDLEF IELRYSGKPA QFLRGFKALY FGLFINAVII GWVNLAMYKI IRIMLPELNP EISIVACVML TTLYSGLSGL WGVTITDMVQ FVISMTGCII LAILALQAPE ITQAGGITNA LPEWMFSFFP SITANPSAEG SGGTLELTFA AFAAFAFIQW WASWYPGSEP GGGGYIAQRM MSAKDEKHSL LATLWFIIAH YCLRPWPWII IGLASLVLFP DLPADQKEDG FVYVMQSLLP PGLKGLLIAA FLAAYMSTLS THLNWGTSYL VNDFYKRFIK TEASSAHYVT IAKVFTACVA VFSLFITFFV LETITGAWEF IIQCGAGTGF VLILRWYWWR LNAWSEIVSM IAPFAAYAWL VLYTDITFPG SIYLIVLFTI AATLLVTYAT PATDEKQLQS FYSVTRVGGF FWKKISDQMP DVVSDKGFFR LFLDWISGII LVYSILFGTG KIIFGEPMEA IMYYGAALLA GIFIYTDLSR RGWNQLS
|
| |