Gene Cag_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0033 
Symbol 
ID3747002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp34929 
End bp36197 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content35% 
IMG OID637772557 
Producthypothetical protein 
Protein accessionYP_378355 
Protein GI78188017 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGGCG CTTGGCGAAA TATCACATGG ATATTTATGA ACTCACGTAC AAATATACTC 
ATGCCATTGA CCAATGCTCG TGGATTAGGC AAAGTAACAT TTTTTACTGG CTTGGATTTT
CCTATTGTTA TGAAACCAAG TTTAATGGGG GAACCTGTAT TAACCGCGCA AGCCTTTTAC
TTTCTAGAAA GGCTTGACTC AGTTGTGACT GCAAGTTTTC CAAGCAATAA GGTTCCATTA
CCTCATGAAA TAGACTACAT TATTGATAAT TACCTATTTG AGTATTCAAA AAGGCATCCT
GACAAAAAAA TAACATCAAA AATAACTGAG TTTGTGTTTT GGCAAGAAGA CCCAGACAAT
GCTTATTTTT CATATGACTG GAAACTTACA GAATGTTTGG TATTAGACAC TTTGAACGAC
ACATCCATTA TTGATTGCAA CGACCCCAAA CGTACAATGG GGGAAATCTT TGATTGGTGT
TTATATAAAC CATATTTCGA AGACGCTTTA GAAGAGTACA AAAACAAACT CGAAGAAGCC
GCAAAATATG TGGCGAATGT CAAAACGCAG AATCACTCGA GTTTGGGAAC TGGAGAATAT
CAACTTCCTA TAATTCGCGT TAATTCAAAA CCATTAACAC TTGCGCAAGT TAACATGCTG
GAGGTGGTTT CAGCAGACAG AAAAATTGAT TTAACATCTG ACACGTACGA AGTTAATCGC
GGAATGAAGT CAAGCACTAA TTACTTTTTA CCTCAAGAAG TAATAACCGT AAACAATCGC
CACAACCCAC AGTTATTAGC CTATTACTTT AGTGCTGTGA GAGATTACTC TCCAATTTCC
CAATTCAAAA ACTACTATAA TGTGCTTGAG TATTTTTTTG AAGAAGCCCC GAATCATCTA
GGTATAACTG CAAAAACAGA AGCCGAACAA ATAATTGCGG TATTAAAATT ATTTATAGAC
CCTGTTGAAT TGAATAAAAA ATTCAATGAA ATAGACAAGG CAACACTTGC GCTAATTGAG
AAACCTCAAA TAACCTCTAG TGGTGAAAAT ATAGCAGGTA TAGATTTTTC CGTTACAGAT
ATTCTTGCAG AATATGGACG GCATATTTAC CAGATAAGAA ATGCGTGCAT TCATTCAAAA
AAAACTCGTA AAGGCAAATC TACACCAAGA TTCATCCCAT CATATGATGA GGAAAAGATT
TTAGAATACG AAATGCCCAT ATTGCAATGG ATTGCGATTC AATGCATTGA AAAAGAAAGT
ATTATTTAA
 
Protein sequence
MLGAWRNITW IFMNSRTNIL MPLTNARGLG KVTFFTGLDF PIVMKPSLMG EPVLTAQAFY 
FLERLDSVVT ASFPSNKVPL PHEIDYIIDN YLFEYSKRHP DKKITSKITE FVFWQEDPDN
AYFSYDWKLT ECLVLDTLND TSIIDCNDPK RTMGEIFDWC LYKPYFEDAL EEYKNKLEEA
AKYVANVKTQ NHSSLGTGEY QLPIIRVNSK PLTLAQVNML EVVSADRKID LTSDTYEVNR
GMKSSTNYFL PQEVITVNNR HNPQLLAYYF SAVRDYSPIS QFKNYYNVLE YFFEEAPNHL
GITAKTEAEQ IIAVLKLFID PVELNKKFNE IDKATLALIE KPQITSSGEN IAGIDFSVTD
ILAEYGRHIY QIRNACIHSK KTRKGKSTPR FIPSYDEEKI LEYEMPILQW IAIQCIEKES
II