Gene Paes_1633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1633 
Symbol 
ID6459286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1777064 
End bp1778434 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content51% 
IMG OID642725621 
ProductNitrogenase 
Protein accessionYP_002016298 
Protein GI194334438 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000322358 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAG CAAAAACAGC CACCCAGAAT GCATGCAAGC TCTGCAACCC GCTTGGCGCA 
TGCCTGGCAT TCAGGGGAAT AGAAAAGTGC GTACCGTTTC TTCACGGCTC TCAAGGATGC
GCGACCTATA TCCGCCGCTA CCTTATCAGC CATTTCAAAG AACCGATCGA TATCGCATCA
TCCAACTTTA ATGAAGATAC CGCAGTGTTT GGCGGCAGCC ACAACCTGAA ACTCGGCCTG
AAAAATGTAA CCCAGCAGTA CCGCCCTGAA GTCATCGGCC TTGCAACCAC CTGCCTTTCC
GAAACGATTG GCGATGATGT CGACATGATC CTCAAGGAGT ATGTCAACCT TTTTGAAAAC
GGCGAACCGC TGCCGAACGG CGAACCTTTG CCTGTGATGA TCCATGCGTC AACGCCAAGC
TATCAAGGCA GCCATATCGA CGGTTTTCAC GCTGCGATGA AAGCCACTGT GGCAAAACTT
GCTGAAGAGG GTCCGAAAAA GAATCTGCTG AACATCTTTC CGAATATGGT CTCGCCTGCA
GATCTGCGGC ACATGAAAGA GATTCTGAAG GATTTCAACA TCCCCTTCGT CCTGCTCCCG
GATTACTCGG AAACACTCGA TGGCGGACCA TGGGCCGAGT ACCACAGAAT CCCCAAAGGC
GGAACACCCG TGAGCACCAT CAAAGAAACT GGTATGGCAT CAGGAAGCAT CGAATTCAGC
TCGGTCCTCA ATACCGAAAA ATCCCCGGCA GGCTATCTCG AAAAAACATT CGCGGTTCCC
CGCTACCAGA TGCCAATGCC AATCGGCATC AAGCAAAGTG ATGCGTTCTT CGGGCTGCTC
GAAAAGCTTT CTGAAAAACC GCTGCCGGAA AAATACGAGG ATGAGCGCAG ACGGCTTGTC
GACGCCTATG CCGACGGACA CAAATATATT TTCGAGAAAA AAGCCATCGT CTACGGGGAA
GAAGACCTTG TCGTCGCCAT GGCGGCCTTC CTTCGTGAAA TCGGTATCGT GCCTGTACTC
TGTGCATCGG GCGGCAAAAG CGGGCTGTTA AAAAAACGCC TGCAGGAACT GATCCCCGAT
CTCGATGAAG CCGGCATCAA GGTTCGTGAC GGCGTGGACT TCGTTGATAT CGAGGATGAA
GCAAAAGTGC TGAAACCCGA TCTGCTCATC GGCAACAGCA AGGGCTATAC CATGTCACGC
AAGCACAACA TCCCGTTCAT CCGGATCGGC TTTCCCATAC ACGACAGATT CGGCGGTCAA
CGGCAGCTTC ATCTGGGTTA CCGTGGAACG CAAGAACTGT TTGACAGAAT TGTCAATACC
GTTATTGCCG AAAAACAGAG TTCTTCACCA ATCGGCTATA CATACATGTA A
 
Protein sequence
MKTAKTATQN ACKLCNPLGA CLAFRGIEKC VPFLHGSQGC ATYIRRYLIS HFKEPIDIAS 
SNFNEDTAVF GGSHNLKLGL KNVTQQYRPE VIGLATTCLS ETIGDDVDMI LKEYVNLFEN
GEPLPNGEPL PVMIHASTPS YQGSHIDGFH AAMKATVAKL AEEGPKKNLL NIFPNMVSPA
DLRHMKEILK DFNIPFVLLP DYSETLDGGP WAEYHRIPKG GTPVSTIKET GMASGSIEFS
SVLNTEKSPA GYLEKTFAVP RYQMPMPIGI KQSDAFFGLL EKLSEKPLPE KYEDERRRLV
DAYADGHKYI FEKKAIVYGE EDLVVAMAAF LREIGIVPVL CASGGKSGLL KKRLQELIPD
LDEAGIKVRD GVDFVDIEDE AKVLKPDLLI GNSKGYTMSR KHNIPFIRIG FPIHDRFGGQ
RQLHLGYRGT QELFDRIVNT VIAEKQSSSP IGYTYM