Gene Ppha_1955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1955 
Symbol 
ID6463074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp2042975 
End bp2044327 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content51% 
IMG OID642728158 
ProductNitrogenase 
Protein accessionYP_002018788 
Protein GI194336994 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0171503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACACG CAAAAACAGC AACACAGAAT GCCTGCAAAC TCTGCAACCC GCTTGGAGCA 
TGTCTTGCCT TCCGGGGCAT AGAGAATTGC GTACCGTTCC TGCACGGTTC ACAGGGGTGT
GCCACCTATA TACGGCGGTA CCTGATCAGC CATTACAAAG AGCCGATCGA TATTGCCTCA
TCGAACTTTC ATGAAGAAAC CGCCGTCTTC GGCGGCAGCC ATAACCTGAA AATCGGACTG
AAAAACGTCT CTGATCAGTA CAAACCAGAG GTTATCGGGG TGGCAACCAC CTGCCTGAGC
GAAACCATCG GCGATGATGT GCCCGGAATT TTGCGGGAAT ATAAAAAGGA GTTCAAAAAC
GGCACACCAA TGCCGATACT GATTCACGCC TCAACGCCGA GCTACCAGGG CAGTCACATT
GACGGTTTTC ATGCCGCAGT CAGGGCGACC GTCAAAACGC TGGCCGAGAA GGGCGAAGAG
CAGAGCCTGA TCAACCTCTT TCCGAACATG GTCTCGCCAG CCGACCTGCG CTATCTCAAG
GAGATCTTCG CTGATTTCAA GGCTCCGGTG ATGTTGCTGC CCGACTATTC CCAGACCATG
GATGGCGGCC CCTGGGGCGA ATATCATCGC ATACCGCCTG GCGGCACTCC GGCAAGCGCT
ATTGTATCGG CAGGAAGCGC CGCAGCAAGC ATTGAGTTCG GTTCAACCCT TGAAGCATCG
AAATCGGCTG CCGGCTATCT TGAAGAGGCG TTTGATGTGC CAAGATATCA TCTCTCCCTG
CCCATCGGCA TCAAGGAGAG CGACAAATTT TTCAGCCTGC TTGAAACACT GACCGGCAAG
GCTCGGCCCG ATAAATATGA CGATGAACGA CGCAGGCTGA TTGACGCCTA TGCCGACGGC
CATAAATATG TTTTCGAGAA AAAGGTAATT CTCTACGGTG AAGAAGACCT TGTGGTTGCC
ATGACCGCAT TTCTCACTGA GATCGGTATG ACGCCCCTCC TCTGTGCTTC GGGAGGAAAA
AGTGGTCTTC TAAAAAAAAG GATCAGGGAG CTGATCCCCA CAATGGATGA ACTCGGTATC
AAGGTACGTG AAGGGGTTGA CTTTGTCGAT ATCGAGGATG AAGCCAAAAT ACTGAAACCC
GATTTTCTTA TCGGTAACAG CAAGGGCTAT ACCATGTCAA GAAAAAACAA CATCCCCTTG
CTTCGGCTCG GTTTCCCCAT TCACGACCGT TTCGGAGGAC AAAGAATGCA CCATCTTGGG
TACAGGGGAA CCCAGGAACT CTTTGACCGG ATCGTCAACA CCGTTATCGA AGAGCGGCAG
AATGCTTCAT CAATCGGTTA CACTTATATG TAA
 
Protein sequence
MKHAKTATQN ACKLCNPLGA CLAFRGIENC VPFLHGSQGC ATYIRRYLIS HYKEPIDIAS 
SNFHEETAVF GGSHNLKIGL KNVSDQYKPE VIGVATTCLS ETIGDDVPGI LREYKKEFKN
GTPMPILIHA STPSYQGSHI DGFHAAVRAT VKTLAEKGEE QSLINLFPNM VSPADLRYLK
EIFADFKAPV MLLPDYSQTM DGGPWGEYHR IPPGGTPASA IVSAGSAAAS IEFGSTLEAS
KSAAGYLEEA FDVPRYHLSL PIGIKESDKF FSLLETLTGK ARPDKYDDER RRLIDAYADG
HKYVFEKKVI LYGEEDLVVA MTAFLTEIGM TPLLCASGGK SGLLKKRIRE LIPTMDELGI
KVREGVDFVD IEDEAKILKP DFLIGNSKGY TMSRKNNIPL LRLGFPIHDR FGGQRMHHLG
YRGTQELFDR IVNTVIEERQ NASSIGYTYM