Gene Cpha266_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1083 
Symbol 
ID4570027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1223760 
End bp1225526 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content48% 
IMG OID639765680 
ProductNa+/solute symporter 
Protein accessionYP_911548 
Protein GI119356904 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0991073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGACAT TAACGCTTCT TGACTACTCG TTTATTGTCG GATACCTTCT CCTGACGCTG 
TTTATCGGCT TGTTGTTTTC AAAAAAAGCC TCTGAAAATG TTGGCGAATT TTTTCTTTCA
GGCAGAAAAC TTCCCTGGTG GATTGCCGGA ACCGGTATGG TTGCAACAAC CTTTGCCGCT
GACACGCCGC TTGCCGTAAC AGGATTGGTA GCAAAAAACG GCATTGCCGG AAACTGGGTC
TGGTGGACGT TTGTCTCCGG AGGAATGCTG ACTGTTTTCT TTTTCGCAAG ACTCTGGCGC
CGATCAAACA TCCTTACCGA CCTTGAATTT ATCGAAATTC GATACAGCGG TACCGCCGCC
AAATTTCTTC GCGGATTCAA GGCGCTCTAT TTCGGACTTT TTATCAATTC GATCATTATC
GGCTGGGTTA ATCTTGCGAT GTACAAGATA ATAAGGATTA TGGTTCCTGA ACTCAACCCC
GAAATCACGA TCATAGCTCT TGTTGTTCTC ACGACCGTCT ACTCAGGACT TTCCGGGTTA
TGGGGCGTTT CTATTACTGA CGCGGTGCAG TTCATTATCG CCATGACCGG CTGCATCATC
CTTGCCGTTC TTGCACTGCA GGCACCTGAA GTTGGAGGCA TCTCCGGTTT ACAGCACGCA
CTTCCAGCCT GGATGTTTGA CTTTTATCCC TCACTTTCCG GCTCCCGAGA AACGCCCGTT
CAGGATAGCG GAGCGTTCTC GCTCCCCTTT GCATCGTTTG CCGCAATGGC ATTTGTCCAA
TGGTGGGCGT CATGGTACCC TGGTTCCGAA CCGGGAGGCG GCGGCTATAT TGCCCAGCGA
ATGATGAGTG CCAAAGATGA AAAGCACTCT CTTCTTGCGA CACTCTGGTT TACCGTTGCT
CACTACTGTC TTCGTCCATG GCCATGGATC ATTGTCGCTC TTGCGAGTCT TGTCATGTTC
CCCGACCTCC CCCTTGATCA GAAAGAGGAC GGATTTGTCT ATGTGATGAA AACCGTTCTG
CCTTCGGGAC TGAAAGGACT GCTTGTGGCC GCATTTCTTG CCGCGTACAT GTCAACCCTT
TCAACCCATC TCAACTGGGG AACAAGCTAC CTGATTAACG ATTTCTATCA ACGGTTTCTG
AAGCCGGAGG CAGAAGCCGC GCATCTGGTA AAGGCGTCAA AAATCGTTAC CGGTCTGATC
GCAATTTTTT CGCTTTTTAT CACGTTCTAT GTACTTAAAA CCATTACAGG GGCATGGGAA
TTCATTATTC AATGTGGTGC CGGCACAGGT TTTGTACTGA TTTTCCGCTG GTTCTGGTGG
CGACTGAACG CGTGGAGTGA AATCACCTCA ATGCTTGCAC CTTTTCTTGC CTATGCATGG
ATTTCCTTTT TCACCTCAAT CACCTTCCCG GACTCATTGT TCATTATCGT CCTGTTTACA
ATATCGTCAA CACTGATTGT AACATTTTTA ACCCCTCCGA CCGATACCGA CCGCTTGCAG
TCATTTTACA GAACCACAAG GGTTGGCGGC ATTCTATGGA AAAAGATTTC CGTGACCATG
CCGGAGGTTG AATCAGACAA GGGATTTATC ATGCTTTTCA TTGATTGGCT GCTCGGCATT
ATCCTTGTTT ACGCCGCACT GTTCGGAACC GGAAAACTCA TCTTTGGAGA TCCAATGCAA
GCCGTTATCT ACTTTGCAAC CGCCCTCGGT GCAGGAACGC TCATCTACAA AGACCTGAAC
CGGCGAGGAT GGAACAATCT GAAATGA
 
Protein sequence
METLTLLDYS FIVGYLLLTL FIGLLFSKKA SENVGEFFLS GRKLPWWIAG TGMVATTFAA 
DTPLAVTGLV AKNGIAGNWV WWTFVSGGML TVFFFARLWR RSNILTDLEF IEIRYSGTAA
KFLRGFKALY FGLFINSIII GWVNLAMYKI IRIMVPELNP EITIIALVVL TTVYSGLSGL
WGVSITDAVQ FIIAMTGCII LAVLALQAPE VGGISGLQHA LPAWMFDFYP SLSGSRETPV
QDSGAFSLPF ASFAAMAFVQ WWASWYPGSE PGGGGYIAQR MMSAKDEKHS LLATLWFTVA
HYCLRPWPWI IVALASLVMF PDLPLDQKED GFVYVMKTVL PSGLKGLLVA AFLAAYMSTL
STHLNWGTSY LINDFYQRFL KPEAEAAHLV KASKIVTGLI AIFSLFITFY VLKTITGAWE
FIIQCGAGTG FVLIFRWFWW RLNAWSEITS MLAPFLAYAW ISFFTSITFP DSLFIIVLFT
ISSTLIVTFL TPPTDTDRLQ SFYRTTRVGG ILWKKISVTM PEVESDKGFI MLFIDWLLGI
ILVYAALFGT GKLIFGDPMQ AVIYFATALG AGTLIYKDLN RRGWNNLK