Gene Cpha266_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0024 
Symbol 
ID4568912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp20845 
End bp22209 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content48% 
IMG OID639764626 
Producttransposase 
Protein accessionYP_910519 
Protein GI119355875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGT CAAAGAAGGC GGCAACAATG TCGCTGGTTC ATCCCAATGC CGCGGGCATT 
GACATCGGAT CGCAGTTTCA TGACGTTGCC ATTCCACCGG ATCGAGCAGA GGAAACAGTA
AAAAGCTTCA AAAGCTTTAC CGGTGATTTG CACGCTATGG CGAAATGGCT GACGGCATGC
AGAATTGATA CTATAGCAAT GGAGTCGACT GGCGTGTACT GGATTCCGGC ATTCGAAATT
CTGGAAAACT ACGGGTTCAA GGTGTTCTTG GTCAATGCCC GTGAAGCTAA AAATGTTCCC
GGCAGGAAGA CTGATAGCAA CGATGCCCAA TGGCTTCAGA AATTGCATCA GCTCGGCTTG
TTGCGCGCAA GTTTTCAGCC GACTTCAGTG ATCGCTGAGT TGCGAGCCTA TCTACGCCAA
CGCGAAAAGT TGCTTGACTA CAAAGCGGCC CACATACAGC ACATGCAGAA AGCCCTGATG
CAGATGAACA TCCAATTGCA TCATGTCGTC TCGACAATTA CGGGTAAAAC CGGTATGGAT
ATTATACGTG CAATTGTTGC GGGAAATCGC AACCCACAAG AACTGGTCAA ATTCAGGGAT
GTCAGGTGCA AGAACTCAAT TGAGACCATG ACAGCTGCCC TGACCGGAAA CTTCAAGCCT
GAGCATATAT TTGCCCTCAT GCAGTCACTG GAACTCTACG ACATCTACAA CGAAAAAGCA
GAGGCCTGCG ATCGTGAAAT TCAGGCTGTT CTTGACCGAT TACAGCAAAA CAGCATACCG
CCGGATCAGC CGCTACCAAA AGCAAAATAC AGGGAATGCA ACAAAAATGC ACCTGCTTTT
GATGTTCGTC AAACACTGTT CAATATTATT GGCGTTGATC TGACGCAAAT CACCGGACTG
GGTTCCTATC TGGCATTGAA GCTCGTTTCC GAATGTGGAG CCGACATGTC GAAATGGCCC
ACCGACAAAC ACTTCACATC ATGGCTTTGC TTGTCTCCTG GTAACAAAAT TTCAGGAGGC
AAAATCCTGT CATCAAGAAC GCGTCCAAGC TCAAGTCGAG CAGCTGCCCT ATTGAGGCTT
GCTGCTACTG CCATAGGCCG AACTGAAACC GCCTTAGGCG CTTTTTATCG AAGACTTGCA
ACAAGAACTG GAAAAGCCAA GGCTGTCACT GCCACAGCTC GAAAGATTGC CGTCCTGTTC
TATAACACGC TTCGATACGG AATGCGCTAT GTTGACCCCG GAGCTGATTA TTATGAAGAA
CAATACAAGG CAAGGATTCT GGGTCAGTTA CGTCGCCGTG CTGACTCCTA TGGGTTTTCT
CTCCAACCCA TGGAAATCCC TGATACGGCT ATAGGAGTTT CTTAG
 
Protein sequence
MRKSKKAATM SLVHPNAAGI DIGSQFHDVA IPPDRAEETV KSFKSFTGDL HAMAKWLTAC 
RIDTIAMEST GVYWIPAFEI LENYGFKVFL VNAREAKNVP GRKTDSNDAQ WLQKLHQLGL
LRASFQPTSV IAELRAYLRQ REKLLDYKAA HIQHMQKALM QMNIQLHHVV STITGKTGMD
IIRAIVAGNR NPQELVKFRD VRCKNSIETM TAALTGNFKP EHIFALMQSL ELYDIYNEKA
EACDREIQAV LDRLQQNSIP PDQPLPKAKY RECNKNAPAF DVRQTLFNII GVDLTQITGL
GSYLALKLVS ECGADMSKWP TDKHFTSWLC LSPGNKISGG KILSSRTRPS SSRAAALLRL
AATAIGRTET ALGAFYRRLA TRTGKAKAVT ATARKIAVLF YNTLRYGMRY VDPGADYYEE
QYKARILGQL RRRADSYGFS LQPMEIPDTA IGVS