Gene Cpha266_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0206 
Symbol 
ID4570587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp230985 
End bp232397 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content50% 
IMG OID639764806 
ProductNa+/solute symporter 
Protein accessionYP_910697 
Protein GI119356053 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGCAC TTGACATTGC TATCATCGTA TTCTTTCTTG CAGGAAGCAT CCTGCTCGGT 
CTATGGCAGG GCAAAAGCAA CAAAAACACA GGAGATTACT TCCTCGGAGG CCACAAGTTT
CCATGGATCG TAGCGATGCT CTCTATCGTC GCGACAGAAA CATCGGTACT GACCTTCGTC
AGCGTTCCGG GGCTTGCCTA CCGTGGGGAC TGGACATTCC TGCAGCTTCC GCTAGGCTAC
ATTCTGGGAA GAATACTTGT CAGTATCCTT CTACTGCCGG TTTATTTCAA AAACGGCGTA
ACTTCAATCT ATGAAGTGAT CGGCTCACGA TTTGGCACCG GCATGCAGAA ACTTGCGTCG
ATCGTATTTC TGGTCACCAG AATTCTTGGC GATGGAGTGC GCTTCCTCGC AACGGGAGTA
GTCGTTCAGG TCGTAACCGG TTGGTCTCTC CCTGTCTCGG TCATGATAAT AGGTGTCGTC
ACACTGGTCT ATACCATAAC CGGAGGGCTG AAAACAGTCG TATGGCTCGA CAGTATCCAG
TTCGGTCTTT ATCTCCTCGG TGGAATTATC ACTATTGCCT TTATCCTGCT GCATCTGGAA
AGCAGCCCAC AGGAAACGTT CGCAGCGCTT GCCGACGCAG GAAAACTTGC AGTGCTGAAC
ACCGGAGGAA ACATTCTCTT TAATCCCATG ACCTTTGGCA GCGCCTTTAC CGGCGGAATA
TTCCTTTCGC TTGCATCTCA CGGCATCGAT TACATGATGG TTCAGCGCGT TCTTGGCTGC
AGAGATCTCG GGTCAGCGCG CAAAGCCCTT ATCGGGAGCG GATTTTTCGT TTTTTTTCAG
TTCATGATCT TTCTGCTCGC CGGTTCGCTC ATGTACCTAT TCATGCAGGG CTCGGTAACG
GAAAAAGACC GTGAGTTCGC ATCGTTCATC GTCAACTACC TCCCTTCGGG ATTGAAGGGG
TTACTGCTTG CCGGAATTCT CTCCGCCGCC ATGTCAACCA TAGCATCGTC GATCAATTCT
CTTGCTGCGT CCACCGTTAC CGATCTGCTC GGAGGACGGG TATCGCTCAA TTTCTCGAAA
CTCATCAGCG CTGGATGGGC AGTGGTACTC ATTGGCATCG CGCTTATTTT CGATGAAAGC
GACAAAGCAA TCATCATGGT TGGCCTTGAA ATAGCATCAT TCACCTATGG CGGACTGCTT
GGGCTCTTCC TGCTGTCAAA AACAACACAA AAATATCATC CCGCAAGCCT TGCCATAGGC
CTGCTTGCGA GCATGGGGAT TGTATTCGTG CTCAAGCTCT ATGGCATTGC CTGGACATGG
TATATTCTCG TTTCCGTTTC TGTAAATATC ATGATAACTA TCCTGGTCAA CACTATGCTC
AGGAGTATAA TGTCGAAAGG TGAACAATCG TAA
 
Protein sequence
MQALDIAIIV FFLAGSILLG LWQGKSNKNT GDYFLGGHKF PWIVAMLSIV ATETSVLTFV 
SVPGLAYRGD WTFLQLPLGY ILGRILVSIL LLPVYFKNGV TSIYEVIGSR FGTGMQKLAS
IVFLVTRILG DGVRFLATGV VVQVVTGWSL PVSVMIIGVV TLVYTITGGL KTVVWLDSIQ
FGLYLLGGII TIAFILLHLE SSPQETFAAL ADAGKLAVLN TGGNILFNPM TFGSAFTGGI
FLSLASHGID YMMVQRVLGC RDLGSARKAL IGSGFFVFFQ FMIFLLAGSL MYLFMQGSVT
EKDREFASFI VNYLPSGLKG LLLAGILSAA MSTIASSINS LAASTVTDLL GGRVSLNFSK
LISAGWAVVL IGIALIFDES DKAIIMVGLE IASFTYGGLL GLFLLSKTTQ KYHPASLAIG
LLASMGIVFV LKLYGIAWTW YILVSVSVNI MITILVNTML RSIMSKGEQS