Gene Cag_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1800 
Symbol 
ID3746915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2320903 
End bp2322540 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content47% 
IMG OID637774338 
Productelectron transfer flavoprotein-ubiquinone oxidoreductase 
Protein accessionYP_380094 
Protein GI78189756 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins)
[COG2440] Ferredoxin-like protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTTG AACCCGAATC CGTTGACTTC GATATTCTTT TTATTGGTGC AGGTCCAGCG 
AATGTAACTG CCGCCATCCA CCTTCAGCGC CTTCTCAACC GCTATAATGA GACTGCCACA
ACAGCGCTTG AACCCTCAAT TGCCATTATT GAAAAAGGGC GTTACGCGAG TGCTCATTTG
CTTTCAGGAG CATTGCTTGA TCCTAAAGCA CTTGAGGAGT TTTTCCCCGA TTATCGCAAC
CAAGGATGCC CCATTGAAGC AACAGTATCC AAAGAGAGCA TCTGGTATGT AAGTGCAAAA
CGGAAGTTCC CCCTGCCATT TATTCCAGAG CAATTCAGTA ATAAAAAATC ACTTATGGTA
TCGCTCAGTA GGCTTGGCGC GTGGTTAGCC GAGCAAGCTG AGGCTGAGGG CATCCAATTG
TTTGATAGCA CTGCGGCTGT TGCTCCATGT ATAGAAAGTG GACGTTTAAC GGGCGTTTAT
ACCGATGACA AAGGGGTGGA TAGTAACGGT GCTCCCAAAG CAAACTATGA GCAAGGATTG
TTGCTTAAAA GTAAGGTTAC CATTGTTGGC GAAGGTGCAG CAGGTTCACT TACGAGGCAG
CTTGCCAAGC ACTTTCCTGC GCTACACGGC AAAACATTGC AACGCTACGA AACAGGTGTT
AAAGAAACAT GGAGCATACC TGAAGGGCGG CTTCAAGCAG GTGAAGTGTA TCATACTTTT
GGCTATCCGC TTGAGCAAGA GCACTATGGT GGTGGATGGG TCTATGCTTT TTCGCCTACT
CTTATTTCGT TGGGATTTGT TTCTTCCCCA AACCAAAGTA ACCCAACCTG CGATCCACAT
GAAAACCTCC AACGTTATAA GCTCCACCCG CTTATTCAGC CAATTCTTGC TGGTGGCAAG
CTGTTAGAAT GTGGGGCGCG AACCATTACT TCGGGCGGTT TAGATGCTAT GCCGCAAATA
TATGGTGATG GATTTTTGCT CACGGGAGAG TCGGCAGGTA TGGTGGATAT GCAGCGCCAT
AAGGGTATTC ACTTGGCGAT GAAGTCGGGC ATGATGGCGG CTGAAACGCT CTTTGACTGC
CTTATTGCCG ATGATTTTTC AACTGCGCAA TTGCAGCGCT ACGAAGAGCG TTTTCGTGCA
TCATGGGCAT ATCAAGAGCT GTATGATGCT CGCAATTACC GCAAAGCGTT TGATAATGGA
TTGTATGCAG GGTTGCTTGC GGCTGGTTTA CAAGTAAATA TTCCGGGACT TTCGTTTGCT
ACAAAGGTGG GGAAAAAAAT AAAACGTGAG CTACCTGAAG CCGATGTTTG TGCTGATGGG
GTTCTTACGT TTACAAAAGA GCGAAGCCTC TTCAACGCCA ATATTCAGCA TGAAGAAAAT
CAACCCTGCC ATCTGCATAT CAATCAAGCC GATATAGAAA ATATTTGCCT AAAGCAATGC
ACTTCACGCT ATGGCAATCC CTGCCAATAT TTTTGCCCTG CTGAAGTCTA TGAAATTGTT
ACGGAACCAA AGTTTGCATT AAAGCTTAAT CCCTCCAACT GCTTGCATTG TAAAACCTGC
GATGTTGCCG ATCCATACGG TATTATCACA TGGACGCCAC CCGAAGGTGG TGGAGGTCCT
GGGTATAAGG TGGGGTAA
 
Protein sequence
MNLEPESVDF DILFIGAGPA NVTAAIHLQR LLNRYNETAT TALEPSIAII EKGRYASAHL 
LSGALLDPKA LEEFFPDYRN QGCPIEATVS KESIWYVSAK RKFPLPFIPE QFSNKKSLMV
SLSRLGAWLA EQAEAEGIQL FDSTAAVAPC IESGRLTGVY TDDKGVDSNG APKANYEQGL
LLKSKVTIVG EGAAGSLTRQ LAKHFPALHG KTLQRYETGV KETWSIPEGR LQAGEVYHTF
GYPLEQEHYG GGWVYAFSPT LISLGFVSSP NQSNPTCDPH ENLQRYKLHP LIQPILAGGK
LLECGARTIT SGGLDAMPQI YGDGFLLTGE SAGMVDMQRH KGIHLAMKSG MMAAETLFDC
LIADDFSTAQ LQRYEERFRA SWAYQELYDA RNYRKAFDNG LYAGLLAAGL QVNIPGLSFA
TKVGKKIKRE LPEADVCADG VLTFTKERSL FNANIQHEEN QPCHLHINQA DIENICLKQC
TSRYGNPCQY FCPAEVYEIV TEPKFALKLN PSNCLHCKTC DVADPYGIIT WTPPEGGGGP
GYKVG