Gene Paes_1632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1632 
Symbol 
ID6458431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1775706 
End bp1777067 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content49% 
IMG OID642725620 
Productnitrogenase MoFe cofactor biosynthesis protein NifE 
Protein accessionYP_002016297 
Protein GI194334437 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00463409 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA TCGGAATACT CGAAGGAAGA GAAAAACAGG TCTACGAAAA AAAAGGTGAC 
TCAGCTGCGA TTGACATCAA ATGCGAAACC ACCAGCCTTT CCGGATCTGT CAGTCAGAGA
GCCTGCGTTT TCTGCGGCTC GCGTGTTGTC CTCTATCCTG TTGCCGATGC CCTTCATGTA
GTTCATGGTC CTATCGGATG TGCCGCGTAT ACGTGGGACA TCCGGGGGGC CGTGTCTTCC
GGCCCGGAAC TGCACAGGTT GAGCTTCTCG ACCGACCTGA AAGAGATGGA CGTTATCTAT
GGCGGAGAAA AGAAATTACA CCAATCACTC ACTGAACTTA TCGCACAGTA TCAACCGAAA
GCAGCGTTTA TTTACTCGAC CTGCATCATC GGACTTATAG GCGACGATAT TGACGCGGTA
TGTAAAAAAG TTTCACAGGA AACCGGCATT CCTGTTCTTC CCGTTCACTC TGAAGGGTTC
AAAGGCACAA AAAAAGACGG CTATAAAGCC GCCTGTGACT CACTGATGAA GCTCGTCGGC
ACCGGATCGA CAGAAGGTAT CGGAAAATAC AGCATTAACA TTTTAGGAGA ATTCAATCTC
GCAGGCGAAG CCTGGATCAT CAAAAAATAC TACGAAGAAA TGGGTATTGA GGTCGTTGCC
ACAATGACAG GCGACGGCAG GGTTGACGAT ATCCGGCGCT CACACGGAGC ATCGCTCAAT
ATCGTCCAGT GCTCGGGATC TATGGTGAAG CTGGCGAAAA TGATGGAAGA AAAGTACGGC
ATCCCCTACC TGAGGGTCTC CTATTTCGGA ATAGAAGATA TGAGTATGGC GCTCTATGAC
GTCGCCAAAC ATTTCAGCGA CAACCCGGCG ATTCTTGATG CAGCCAAAAA ACTTGTCAAC
CGTGAGGTCA GCGAACTCTA TCCGCGTCTG CAGCACTTCC GTCAAGCGCT GGAAGGCAAA
AAAGCCGCAA TCTATGTCGG TGGAGCATTT AAAGCCTTCT CGCTGATCAA AGCCCTGAAT
TCCGTAGGAA TGAGCGTCGT ACTTGCAGGA TCACAGACCG GCAACAAAGA CGATTATGAG
GGACTCAAAG AGATGTGCGA AGAAGGGACC GTTATCGTCG ATGACTCCAA TCCGGTTGAA
CTCTCCAAAT TCGTACTTGA AAAAGAAGCC GATCTCCTCA TAGGCGGCGT TAAGGAACGG
CCAATCGCAT ATAAACTCGG TATCGGATTC TGCGACCACA ATCATGAACG CAAAATTCCC
CTGGCCGGTT TTGTCGGCAT GTACAACTTT ATCCTGGAGG TTTACAATTC CGTCATGAGC
CCGGTCTGGC AGTTTGCTCC GAGAAAAGGA GGATTATCAT GA
 
Protein sequence
MEKIGILEGR EKQVYEKKGD SAAIDIKCET TSLSGSVSQR ACVFCGSRVV LYPVADALHV 
VHGPIGCAAY TWDIRGAVSS GPELHRLSFS TDLKEMDVIY GGEKKLHQSL TELIAQYQPK
AAFIYSTCII GLIGDDIDAV CKKVSQETGI PVLPVHSEGF KGTKKDGYKA ACDSLMKLVG
TGSTEGIGKY SINILGEFNL AGEAWIIKKY YEEMGIEVVA TMTGDGRVDD IRRSHGASLN
IVQCSGSMVK LAKMMEEKYG IPYLRVSYFG IEDMSMALYD VAKHFSDNPA ILDAAKKLVN
REVSELYPRL QHFRQALEGK KAAIYVGGAF KAFSLIKALN SVGMSVVLAG SQTGNKDDYE
GLKEMCEEGT VIVDDSNPVE LSKFVLEKEA DLLIGGVKER PIAYKLGIGF CDHNHERKIP
LAGFVGMYNF ILEVYNSVMS PVWQFAPRKG GLS