Gene Cag_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1248 
Symbol 
ID3748286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1710520 
End bp1711977 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content48% 
IMG OID637773786 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_379552 
Protein GI78189214 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.36292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTGCCAC TATTTTCTTA TGAAAAAGAT GATAACACAG GCGCCCTTGT AAGTACTCAC 
AAACAGGAGA AAACTCAAAT GCTATTACGA CACACACCAA AAGAGGTTAA AGAGCGCGAG
GGGCTAACCA TTAACCCGGC AAAAACATGC CAGCCCATTG GCGCCATGTA CGCTGCTCTT
GGTATTCACG GCTGCCTACC TCACAGCCAC GGTTCACAAG GGTGCTGCTC ATACCACCGC
AGCACGCTTA CTCGCCACTA CAAAGAGCCA GTAATGGCTG CAACAAGCTC GTTTACTGAA
GGCGCTTCGG TATTTGGAGG TCAGGCAAAC TTGCTATCGG CTATCGAAAC CATCTTCTCG
GTTTACGATC CCGATATTAT TGCCGTACAC TCAACCTGCC TCAGCGAAAC CATTGGCGAC
GACTTGCAGC AAATTACCAA AAAGGCAAGC GACGATGGTA AAATTCCAGC AGGCAAATAT
GTGATTTATG CCAGCACGCC AAGCTTTGTA GGCTCACACG TTACCGGCTA TTCCAACATG
GTTGCGGGTA TTGCCGAACA ATTTGCACAA GTGAGCGACA CAAAAACCGA CCAAATCAAC
ATTGTTGCTG GTTGGATGGA GCCTTCCGAC ATGCGCGAAA TTAAGCGTCT GTCAAGCGAG
TTGGGCGTTA AAATTGTCCT CTTCCCCGAC ACTTCGGACG TGCTTGATGC TCCGCAAACC
GGCAAGCATG TTTTCTACCC AAAAGGTGGA ACAACGATTG ACGAACTAAA AAGCATTGGC
TCCAGCAAAT GTTCTTTAGC GCTCGGTTGC ATTAGTGCTG AACCTGCGGC TATTGCCATT
GAGAAAAAGT GCAAAGTGCC ATTTGAAACG GTGGATATGC CAATTGGTGT AAGCGCTACC
GATCGCTTCC TGATGGCGCT CAGCAAAGCG GCACATGTGC AAATTCCTGC ACACATTACG
GCTGATCGCG GACGTTTGGT TGATGTTATG ACCGATATGG AGCAATACTT CTACGGCAAA
AAAGTTGCTC TCTTTGGCGA CCCCGATCAG CTCATTCCGC TCACCGAGTT CTTGCTCGAC
CTTGGCATGA AGCCAACGCA CATTGTAAGC GGTACGCCCG GTATGCGTTT TGAAAAGCGC
ATGAAAGAAA TTCTGAAGCG TGCACCGTAT GCAAACTTTA AAAACGGACT TGGTGCCGAT
ATGTTCTTAC TCCATCAATG GATGAAAAAC GAACCTGTTG ACTTGCTTAT TGGCAACACT
TATGGCAAGT ACATTGCTCG TGATGAAGAT ATCCCATTTG TACGCTACGG CTTCCCCATT
CTCGACCGCA TTGGGCACAG CTACTTCCCA AGCGTTGGCT ACATGGGTGG CTTACGACTT
GTGGAAAAAA TTCTTAGCGC ATTGATGGAT CGCCAAGATA GCGCAGCATT AGAAGAGAAG
TTTGAGTTAG TGATGTAA
 
Protein sequence
MLPLFSYEKD DNTGALVSTH KQEKTQMLLR HTPKEVKERE GLTINPAKTC QPIGAMYAAL 
GIHGCLPHSH GSQGCCSYHR STLTRHYKEP VMAATSSFTE GASVFGGQAN LLSAIETIFS
VYDPDIIAVH STCLSETIGD DLQQITKKAS DDGKIPAGKY VIYASTPSFV GSHVTGYSNM
VAGIAEQFAQ VSDTKTDQIN IVAGWMEPSD MREIKRLSSE LGVKIVLFPD TSDVLDAPQT
GKHVFYPKGG TTIDELKSIG SSKCSLALGC ISAEPAAIAI EKKCKVPFET VDMPIGVSAT
DRFLMALSKA AHVQIPAHIT ADRGRLVDVM TDMEQYFYGK KVALFGDPDQ LIPLTEFLLD
LGMKPTHIVS GTPGMRFEKR MKEILKRAPY ANFKNGLGAD MFLLHQWMKN EPVDLLIGNT
YGKYIARDED IPFVRYGFPI LDRIGHSYFP SVGYMGGLRL VEKILSALMD RQDSAALEEK
FELVM