Gene Cphamn1_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1120 
Symbol 
ID6374795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1204916 
End bp1206679 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content52% 
IMG OID642683622 
ProductNa+/solute symporter 
Protein accessionYP_001959539 
Protein GI189500069 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.460734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAGC TTACCTCTCT TGATTTCAGC ATCATCGCCG GCTATCTGGT GCTGACCCTG 
CTTATCGGGC TCTTTTTTTC AAAAAGGGCT TCTCAGAACG TCGGCGAGTT CTTTCTTTCG
GGACGTAAGC TGCCCTGGTG GATTGCCGGA ACCGGTATGG TCGCCACTAC TTTCGCCGCC
GATACGCCGC TTGCCGTTAC AGGCCTTGTG GCCAAACACG GCATTGCGGG AAACTGGCTC
TGGTGGACGT TTGTATCAGG AGGAATGCTC ACCGTGTTTT TCTTTGCACG ACTATGGCGA
AGGGCAAACA TTCTCACCGA CCTTGAATTT ATCGAGCTGC GTTACAGCGG CAAACCCGCA
CAGTTTCTCC GGGGCTTCAA GGCTCTCTAC TTCGGGCTCT TTATCAATGC GGTGATCATC
GGCTGGGTGA ACCTGGCCAT GTACAAGATC ATCAGAATCA TGCTTCCGGA ACTGAATCCT
GAAATATCCA TTGTTGCCTG CGTTATGCTG ACGACCCTTT ATTCAGGTCT TTCAGGCCTC
TGGGGCGTAA CCATAACAGA CATGGTACAA TTCGTCATAT CCATGACGGG ATGTATTATT
CTGGCCATCC TGGCGCTTCA GGCTCCGGAA ATCACACAGG CGGGGGGCAT AACAAACGCC
CTGCCGGAAT GGATGTTCAG CTTTTTCCCC TCGATCACCG CAAACCCGTC CGCCGAAGGA
TCAGGAGGCA CTCTTGAACT TACCTTTGCC GCGTTTGCCG CATTCGCTTT CATACAGTGG
TGGGCTTCCT GGTATCCCGG CTCGGAACCG GGGGGCGGGG GCTACATCGC TCAACGCATG
ATGAGCGCCA AAGACGAAAA GCATTCTCTG CTTGCCACGC TATGGTTCAT TATCGCGCAC
TACTGTCTGA GACCCTGGCC ATGGATCATT ATTGGACTCG CGAGTCTGGT GCTTTTCCCT
GACCTGCCTG CCGACCAGAA AGAAGACGGT TTTGTCTATG TCATGCAGTC CCTTCTTCCG
CCGGGCCTCA AAGGGCTGCT GATAGCCGCT TTTCTGGCCG CGTATATGTC CACCCTCTCA
ACGCACCTCA ACTGGGGGAC AAGCTATCTG GTCAATGATT TCTATAAACG CTTTATCAAA
ACGGAAGCCT CTTCGGCTCA CTACGTTACC ATTGCAAAGG TGTTCACCGC ATGCGTTGCG
GTTTTTTCTC TATTCATAAC CTTCTTTGTG CTGGAAACCA TCACCGGCGC CTGGGAATTC
ATTATCCAGT GCGGAGCCGG CACAGGATTC GTACTCATCC TCCGCTGGTA CTGGTGGAGA
CTCAACGCAT GGTCGGAGAT CGTTTCCATG ATCGCGCCGT TTGCCGCATA CGCCTGGCTT
GTTCTGTATA CCGACATCAC TTTCCCGGGC TCTATCTACC TTATCGTTCT CTTTACCATA
GCAGCAACGC TGCTTGTCAC CTATGCAACC CCGGCTACGG ACGAGAAACA GCTTCAGAGC
TTCTACTCGG TCACCAGGGT CGGAGGATTT TTCTGGAAAA AAATATCCGA CCAGATGCCG
GACGTAGTAT CTGATAAAGG TTTCTTCAGA CTTTTTCTCG ACTGGATCTC AGGCATTATT
CTTGTCTATT CGATACTCTT CGGCACGGGA AAAATTATTT TCGGAGAGCC GATGGAAGCC
ATAATGTACT ACGGAGCTGC CCTGCTTGCC GGCATATTCA TCTATACTGA CCTGAGTCGC
AGGGGGTGGA ACCAACTGAG CTGA
 
Protein sequence
MEQLTSLDFS IIAGYLVLTL LIGLFFSKRA SQNVGEFFLS GRKLPWWIAG TGMVATTFAA 
DTPLAVTGLV AKHGIAGNWL WWTFVSGGML TVFFFARLWR RANILTDLEF IELRYSGKPA
QFLRGFKALY FGLFINAVII GWVNLAMYKI IRIMLPELNP EISIVACVML TTLYSGLSGL
WGVTITDMVQ FVISMTGCII LAILALQAPE ITQAGGITNA LPEWMFSFFP SITANPSAEG
SGGTLELTFA AFAAFAFIQW WASWYPGSEP GGGGYIAQRM MSAKDEKHSL LATLWFIIAH
YCLRPWPWII IGLASLVLFP DLPADQKEDG FVYVMQSLLP PGLKGLLIAA FLAAYMSTLS
THLNWGTSYL VNDFYKRFIK TEASSAHYVT IAKVFTACVA VFSLFITFFV LETITGAWEF
IIQCGAGTGF VLILRWYWWR LNAWSEIVSM IAPFAAYAWL VLYTDITFPG SIYLIVLFTI
AATLLVTYAT PATDEKQLQS FYSVTRVGGF FWKKISDQMP DVVSDKGFFR LFLDWISGII
LVYSILFGTG KIIFGEPMEA IMYYGAALLA GIFIYTDLSR RGWNQLS