Gene Ava_3917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3917 
Symbol 
ID3683494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4871065 
End bp4872507 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content45% 
IMG OID637719269 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_324417 
Protein GI75910121 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCTC CAGAAAACAA GAATCTTGTA GATGAAAATA AAGAACTTAT TAAAGAAGTT 
CTGAAAGCTT ATCCCGAAAA ATCTCGCAAA AAACGCGAAA AGCACCTCAA CGTCCACGAA
GAAAACAAGT CTGATTGCGG CGTAAAGTCT AACATCAAAT CCGTTCCTGG TGTAATGACC
GCTCGTGGTT GTGCTTATGC AGGTTCTAAG GGTGTGGTTT GGGGTCCTAT TAAGGACATG
ATTCACATCA GCCACGGGCC TGTAGGTTGC GGTTACTGGT CTTGGTCTGG TCGTCGTAAC
TACTACGTTG GTGTAACTGG TATCAACTCT TTCGGTACCA TGCACTTTAC ATCAGACTTC
CAAGAACGTG ACATCGTGTT CGGTGGTGAC AAAAAACTCG TTAAACTCAT TGAAGAACTT
GACGTTCTGT TCCCTCTAAA CCGTGGTGTT TCCATTCAAT CTGAATGTCC CATCGGTCTA
ATTGGGGATG ACATCGAAGC TGTAGCTAAG AAAACTTCTA AGCAAATTGG TAAGCCTGTT
GTACCCTTAC GTTGCGAAGG TTTCCGTGGT GTATCTCAGT CTTTAGGACA CCACATCGCT
AACGACGCTA TCCGTGACTG GATTTTCCCA GAATACGACA AGCTGAAGAA AGAAAACAGA
CTCGACTTCG AGCCAAGCCC CTATGATGTA GCCCTAATCG GTGACTACAA CATCGGTGGT
GACGCTTGGG CTAGCCGTAT GTTGTTGGAA GAAATGGGCT TACGTGTTGT AGCTCAGTGG
TCTGGTGATG GTACTCTTAA CGAGTTGATC CAAGGCCCTG CTGCTAAGTT AGTCCTCATC
CACTGCTACC GTTCTATGAA CTACATCTGC CGTAGTTTGG AAGAACAATA TGGTATGCCT
TGGATGGAGT TCAACTTCTT CGGCCCCACC AAGATTGCTG CTTCCTTACG TGAAATCGCA
GCTAAGTTTG ATTCCAAGAT TCAAGAAAAC GCTGAGAAGG TAATTGCTAA GTACACACCA
GTAATGAATG CTGTACTTGA TAAGTACCGT CCTCGTTTGG AAGGTAACAC CGTAATGTTG
TACGTAGGTG GTCTACGTCC TCGTCACGTT GTTCCTGCTT TTGAAGATTT GGGTATCAAA
GTTATCGGTA CAGGTTACGA GTTCGCCCAC AACGACGACT ACAAACGTAC CACCCACTAC
ATCGATAACG CCACCATCAT TTATGATGAC GTTACCGCCT ACGAATTTGA AGAGTTCGTA
AAAGCTAAGA AGCCTGATTT AATCGCTTCT GGTATTAAAG AGAAGTATGT CTTCCAAAAG
ATGGCTCTTC CCTTCCGTCA AATGCACTCT TGGGATTACT CCGAACCTAG CGATGGGGTG
CAAATGTCAG ATCAGATAAG GTTTTTTGGT GAGGGGAGAA AAATAAGTCT ATTTTTAGCC
TAA
 
Protein sequence
MTPPENKNLV DENKELIKEV LKAYPEKSRK KREKHLNVHE ENKSDCGVKS NIKSVPGVMT 
ARGCAYAGSK GVVWGPIKDM IHISHGPVGC GYWSWSGRRN YYVGVTGINS FGTMHFTSDF
QERDIVFGGD KKLVKLIEEL DVLFPLNRGV SIQSECPIGL IGDDIEAVAK KTSKQIGKPV
VPLRCEGFRG VSQSLGHHIA NDAIRDWIFP EYDKLKKENR LDFEPSPYDV ALIGDYNIGG
DAWASRMLLE EMGLRVVAQW SGDGTLNELI QGPAAKLVLI HCYRSMNYIC RSLEEQYGMP
WMEFNFFGPT KIAASLREIA AKFDSKIQEN AEKVIAKYTP VMNAVLDKYR PRLEGNTVML
YVGGLRPRHV VPAFEDLGIK VIGTGYEFAH NDDYKRTTHY IDNATIIYDD VTAYEFEEFV
KAKKPDLIAS GIKEKYVFQK MALPFRQMHS WDYSEPSDGV QMSDQIRFFG EGRKISLFLA