Gene Cag_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1249 
Symbol 
ID3748287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1712231 
End bp1713592 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content46% 
IMG OID637773787 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_379553 
Protein GI78189215 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.265854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGTA TTGGAATTCT TGAAGCACGC CAAAAGCAGG TTATTGAAAA AAAAGCGGGC 
GAGTCCCAGC CCGAAATTGC TTGCGATACC ACCAGCCTTT CTGGCTCAGT AAGTCAACGC
GCTTGCGTTT TTTGCGGTTC TCGCGTCGTG CTTTATCCCG TAGCTGATGC AATCCATTTA
GTACACGGTC CCATTGGGTG CGCTGCTTAC ACATGGGATA TTCGTGGAGC CGTCTCCTCA
GGTCCCGAGT TACACCGTTT AAGCTTTTCC ACCGACTTGC ACGAAATGGA GGTGATTTAC
GGTGGCGAAA AAAAGCTGTA TAGTTCCTTG AAAGAGCTGA TTGCCCAATA CCAGCCAAAA
GCCGCCTTTA TTTACTCAAC CTGCATTGTT GGCTTAATTG GCGACGACAT TGACGCTGTC
TGCAAAAAGG TTGAAAAAGA GACGGGCATT CCCGTTTTAC CCGTTCACTC CGAAGGCTTT
AAAGGCACCA AAAAAGATGG CTACAAAGCT GCTTGCTTCT CGTTAATGAA GCTGATTGGC
ACAGGTTCAA CCGAAGGCAT TAGCAAATAC AGCATCAATA TTTTGGGTGA ATTTAACCTT
GCGGGTGAAG CATGGATGAT TCGCCAATAT TATGAAAACA TGGGCGTTGA AGTGGTTGCC
ACCTTAACGG GCGATGGTCG CATTGATGCT ATTCGCCGTG CGCATGGCGC TTCGCTCAAC
GTGGTACAAT GCTCAGGCTC TATGACGTGG TTGGCGAAGG AGATGGAAGC CAAGTATGGC
ATTCCCTTTA TTCGCGTTTC CTACTTCGGC ATTGAGGATA TGTCGAAATC ACTCTACGAT
GTAGCACGCC ATTTTGAAGA TCGCCCTGAA ATAATGGAGG CAACCAAAAA GATTGTAAGC
GATGAGGTAA CCAAACTCTA CCCTTCACTG CAAAAATTTA AGAAAGCCTT ACAGGGTAAA
AAAGCCGCTA TTTATGTTGG TGGAGCCTTT AAAACATTTT CGTTGATAAA AGCTCTTCGC
TCAATTGGTA TGTCGGTCGT GTTAGCAGGA TCGCAAACCG GCAATAAGGA CGACTATAAT
CGCCTAAAAG AGATGTGTGA TGAAGGCACC ATTATTGTTG ACGACTCAAA CCCTGTTGAG
CTTTCAAAAT TCATTCTTGA AAAAGAGGCT GATTTATTGA TTGGTGGGGT AAAAGAGCGC
CCCATCGCCT TTAAGCTTGG CGTAGCTTTT TGCGACCACA ACCACGAGCG CAAAATTCCA
CTCGCAGGCT TTGAAGGTAT GTACAATTTT GCATTGGAGG TCTATCAATC GGTTATGAGT
CCCGTGTGGC AGTTTGCTCC ACGAAAAGGA GGTGCGCTAT GA
 
Protein sequence
MDSIGILEAR QKQVIEKKAG ESQPEIACDT TSLSGSVSQR ACVFCGSRVV LYPVADAIHL 
VHGPIGCAAY TWDIRGAVSS GPELHRLSFS TDLHEMEVIY GGEKKLYSSL KELIAQYQPK
AAFIYSTCIV GLIGDDIDAV CKKVEKETGI PVLPVHSEGF KGTKKDGYKA ACFSLMKLIG
TGSTEGISKY SINILGEFNL AGEAWMIRQY YENMGVEVVA TLTGDGRIDA IRRAHGASLN
VVQCSGSMTW LAKEMEAKYG IPFIRVSYFG IEDMSKSLYD VARHFEDRPE IMEATKKIVS
DEVTKLYPSL QKFKKALQGK KAAIYVGGAF KTFSLIKALR SIGMSVVLAG SQTGNKDDYN
RLKEMCDEGT IIVDDSNPVE LSKFILEKEA DLLIGGVKER PIAFKLGVAF CDHNHERKIP
LAGFEGMYNF ALEVYQSVMS PVWQFAPRKG GAL