Gene Cag_1250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1250 
Symbol 
ID3748288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1713589 
End bp1714947 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content48% 
IMG OID637773788 
Productnitrogenase iron-molybdenum cofactor biosynthesis protein NifN, putative 
Protein accessionYP_379554 
Protein GI78189216 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.113259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATG AACATGCAAA ATCCGTTACG CAAAACGCTT GTAAGCTTTG CAACCCACTT 
GGTGCCTGCC TTGCATTCCG TGGCATTGAG CAGTGTGTGC CATTTTTACA CGGTTCGCAA
GGGTGCGCCA CCTACATTCG TCGCTATTTA ATTAGCCACT ATAAAGAGCC AATTGATATT
GCTTCATCAA ACTTTAACGA AGAAACAGCG GTCTTTGGTG GCAGCCACAA CTTAAAGGTG
GGCTTAAAAA ACGTAAGCCA GCAATACAAG CCGCAAGTAA TTGGCATTGC TACAACCTGC
TTAAGTGAAA CCATTGGCGA CGATGTACCA CGCATTTTAC GTGAGTACCA AAAAGAGTTT
AAAAACGGCA CACCAATGCC GCTTTTGATT CACGCATCAA CGCCAAGCTA CCAAGGGAGC
CACATTGATG GATTTCATGC AGCCGTTCAT GCAGCCATTA AAACGCTTGC AACCAAAGGG
CAAAAGCAAG AGCAGATCAA CCTCTTTCCC AACATGGTCT CGCCCGCTGA TTTGCGCCAC
CTGAAAGAGA TTTTTGCGGA CTTTGAGATT CCGCTCATGA TGTTGCCCGA CTATTCGCAA
ACTATGGATG GCGGACCGTG GGCAGAGTAC CACCGCATTC CACCGGGAGG CACGCCAGCA
ACGGCTATTG CTGATTCTGC AAATAGCCGT GCAAGCATTG AATTTGGCTC CACTATTGAA
GCAAACAAAT CAGCAGCACA CTATCTTGAT GTCATGTTTG GTATTCCAGC GTATCGCATG
GCGCTCCCAA TTGGCATTAA AGCAAGCGAT CGCTTTTTCA GCCTGCTTGA AACCTTGAGC
GAAAAAGGGC GCCCTGAAAA GTATGACGAT GAACGTCGCC GCTTAGTAGA TGCCTATGCT
GACGGACACA AATATGTTTT TGAAAAAAAG GTGATTCTCT ACGGCGAAGA AGACCTTGTA
GTTGCCATAA CCGCCTTTTT ACGCGAAATA GGCATGATTC CCGTGCTTTG CGCCTCAGGC
GGAAAGAGCG GCATGTTAAA GGAGCGCATT GCAGAAATTG TGCCCGATAT GGAAGAGCTT
GGCATTAAAG TGCGCGATGG CGTTGACTTT GTTGATATCG AAGATGAAGC TAAAGTGCTA
CACCCCGATT TACTCATGGG CAACAGCAAA GGCTTTACCA TGTCGCGTAA AAATGAGATT
CCGCTCTTAC GCCTTGGCTT CCCAATCCAC GACCGCTTTG GCGGGCAGCG TATGCACCAC
CTTGGCTACC GCGGCACCCT TGAATTGTTC GACCGCATTG TCAACATGAT TATTGAAACA
CGTCAGAACG CATCACCAAT TGGCTACACT TATATGTAA
 
Protein sequence
MKHEHAKSVT QNACKLCNPL GACLAFRGIE QCVPFLHGSQ GCATYIRRYL ISHYKEPIDI 
ASSNFNEETA VFGGSHNLKV GLKNVSQQYK PQVIGIATTC LSETIGDDVP RILREYQKEF
KNGTPMPLLI HASTPSYQGS HIDGFHAAVH AAIKTLATKG QKQEQINLFP NMVSPADLRH
LKEIFADFEI PLMMLPDYSQ TMDGGPWAEY HRIPPGGTPA TAIADSANSR ASIEFGSTIE
ANKSAAHYLD VMFGIPAYRM ALPIGIKASD RFFSLLETLS EKGRPEKYDD ERRRLVDAYA
DGHKYVFEKK VILYGEEDLV VAITAFLREI GMIPVLCASG GKSGMLKERI AEIVPDMEEL
GIKVRDGVDF VDIEDEAKVL HPDLLMGNSK GFTMSRKNEI PLLRLGFPIH DRFGGQRMHH
LGYRGTLELF DRIVNMIIET RQNASPIGYT YM