Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0923 |
Symbol | |
ID | 4570614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1051458 |
End bp | 1052435 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639765519 |
Product | nucleotidyl transferase |
Protein accession | YP_911395 |
Protein GI | 119356751 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1209] dTDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01208] glucose-1-phosphate thymidylylransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000758587 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCCA TCATTCCTGT TGCAGGCGTT GGAACCCGTC TGCGTCCTCA CACCTACTCG CACCCGAAAG TTCTTTTAAA TGTTGCCGGC AAACCGATCA TCGGCCACAT CATGGACAAG CTTATTGATG CCGGAATCGA TGAAGCTATT GTCATTGTAG GTTATCTCGG CAGTATGGTC GAAGATTGGC TGCGCAAACA CTACACCATA AAGTTCACCT TTGTCGATCA GACTGAAATG CTTGGTCTTG CCCATGCGGT CTGGATGTGT AAAGACCATG TCGATAAAAC CGATCCCCTG CTTATCATTC TTGGCGACAC GGTTTTCGAT GTCGATCTTT CCCCCGTGCT TCAGAGCCCC TGCTCAACGC TGGGAGTCAA GGAGGTCGAA GACCCTCGAC GGTTCGGAGT GGCGGTGATG GAAGAAAACC GCATTAAAAA ACTTGTTGAA AAACCTGATA CGCCTGTAAG CAATCTGGCC ATTGTCGGCC TCTATTTTCT TTACAAGGCG CAGCCGCTTT TTGAGTGCAT CGATCACCTG ATCAGCAATG AGATAAAAAC CAAAGGAGAG TACCAGCTTA CCGATGCGCT CCAACTCATG ATTGAACGGG GTGAACCGTT TACAACATTT CCTGTTGAAG GGTGGTATGA TTGCGGCAAA CCTGAAACGC TTCTCTCTAC CAACGAAATT CTCCTGCAGA AAACAGTGTC AGGAAAAACG TTTCCAGGAT GCATCATCAA CGAACCCGTA TTCATAGCCG ACAGCGCCAC GCTTGAAAAT GCCATTATCG GACCAAACAC ATCCGTCGCT GAACATGCCG TCATAACTGA TGCCGTCGTA AAAAACTCCA TTATCGGCAG TGAGGCCCAG GTAACCGGTG TAATGCTCAC CCAATCGATT GTAGGCAACA ACGCATCCAT TAACGGGTCC TTCCATAAAA TCAATATCGG CGATTACTCG GAAATAATGA TTGGATAA
|
Protein sequence | MKAIIPVAGV GTRLRPHTYS HPKVLLNVAG KPIIGHIMDK LIDAGIDEAI VIVGYLGSMV EDWLRKHYTI KFTFVDQTEM LGLAHAVWMC KDHVDKTDPL LIILGDTVFD VDLSPVLQSP CSTLGVKEVE DPRRFGVAVM EENRIKKLVE KPDTPVSNLA IVGLYFLYKA QPLFECIDHL ISNEIKTKGE YQLTDALQLM IERGEPFTTF PVEGWYDCGK PETLLSTNEI LLQKTVSGKT FPGCIINEPV FIADSATLEN AIIGPNTSVA EHAVITDAVV KNSIIGSEAQ VTGVMLTQSI VGNNASINGS FHKINIGDYS EIMIG
|
| |