Gene Cpha266_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0923 
Symbol 
ID4570614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1051458 
End bp1052435 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content48% 
IMG OID639765519 
Productnucleotidyl transferase 
Protein accessionYP_911395 
Protein GI119356751 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000758587 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCA TCATTCCTGT TGCAGGCGTT GGAACCCGTC TGCGTCCTCA CACCTACTCG 
CACCCGAAAG TTCTTTTAAA TGTTGCCGGC AAACCGATCA TCGGCCACAT CATGGACAAG
CTTATTGATG CCGGAATCGA TGAAGCTATT GTCATTGTAG GTTATCTCGG CAGTATGGTC
GAAGATTGGC TGCGCAAACA CTACACCATA AAGTTCACCT TTGTCGATCA GACTGAAATG
CTTGGTCTTG CCCATGCGGT CTGGATGTGT AAAGACCATG TCGATAAAAC CGATCCCCTG
CTTATCATTC TTGGCGACAC GGTTTTCGAT GTCGATCTTT CCCCCGTGCT TCAGAGCCCC
TGCTCAACGC TGGGAGTCAA GGAGGTCGAA GACCCTCGAC GGTTCGGAGT GGCGGTGATG
GAAGAAAACC GCATTAAAAA ACTTGTTGAA AAACCTGATA CGCCTGTAAG CAATCTGGCC
ATTGTCGGCC TCTATTTTCT TTACAAGGCG CAGCCGCTTT TTGAGTGCAT CGATCACCTG
ATCAGCAATG AGATAAAAAC CAAAGGAGAG TACCAGCTTA CCGATGCGCT CCAACTCATG
ATTGAACGGG GTGAACCGTT TACAACATTT CCTGTTGAAG GGTGGTATGA TTGCGGCAAA
CCTGAAACGC TTCTCTCTAC CAACGAAATT CTCCTGCAGA AAACAGTGTC AGGAAAAACG
TTTCCAGGAT GCATCATCAA CGAACCCGTA TTCATAGCCG ACAGCGCCAC GCTTGAAAAT
GCCATTATCG GACCAAACAC ATCCGTCGCT GAACATGCCG TCATAACTGA TGCCGTCGTA
AAAAACTCCA TTATCGGCAG TGAGGCCCAG GTAACCGGTG TAATGCTCAC CCAATCGATT
GTAGGCAACA ACGCATCCAT TAACGGGTCC TTCCATAAAA TCAATATCGG CGATTACTCG
GAAATAATGA TTGGATAA
 
Protein sequence
MKAIIPVAGV GTRLRPHTYS HPKVLLNVAG KPIIGHIMDK LIDAGIDEAI VIVGYLGSMV 
EDWLRKHYTI KFTFVDQTEM LGLAHAVWMC KDHVDKTDPL LIILGDTVFD VDLSPVLQSP
CSTLGVKEVE DPRRFGVAVM EENRIKKLVE KPDTPVSNLA IVGLYFLYKA QPLFECIDHL
ISNEIKTKGE YQLTDALQLM IERGEPFTTF PVEGWYDCGK PETLLSTNEI LLQKTVSGKT
FPGCIINEPV FIADSATLEN AIIGPNTSVA EHAVITDAVV KNSIIGSEAQ VTGVMLTQSI
VGNNASINGS FHKINIGDYS EIMIG