Gene Ava_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4054 
Symbol 
ID3681675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5039282 
End bp5040256 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content46% 
IMG OID637719405 
Productnitrogenase iron protein subunit NifH 
Protein accessionYP_324553 
Protein GI75910257 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1348] Nitrogenase subunit NifH (ATPase) 
TIGRFAM ID[TIGR01287] nitrogenase iron protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00458685 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.287563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACG AAAACATTAG ACAGATAGCT TTTTACGGTA AAGGCGGTAT CGGTAAATCC 
ACCACTTCCC AAAATACCCT CGCAGCAATG GCAGAAGTGG GACAACGTAT TCTGATTGTC
GGATGTGACC CCAAAGCAGA CTCTACCCGC TTGATTCTCC ACACCAAAGC ACAAACTACC
GTACTTCACT TAGCTGCTGA ACGCGGTGCA GTGGAAGACT TAGAACTTGA TGAAGTAGTA
CTCAAAGGCT TCCGCGATAT CAAATGCGTA GAATCTGGTG GTCCAGAACC CGGTGTAGGT
TGCGCTGGTC GTGGTATTAT CACCGCTATT AACTTCCTCG AAGAAAACGG TGCATATCAA
GACGTAGATT TCGTATCTTA CGACGTATTA GGTGACGTTG TATGCGGTGG TTTCGCCATG
CCAATTCGGG AAGGGAAAGC GCAAGAAATC TACATCGTTA CTTCCGGTGA AATGATGGCG
ATGTACGCTG CAAACAACAT CGCTCGCGGT ATTTTGAAAT ATGCTCACTC CGGCGGTGTA
CGCCTGGGTG GTCTAATTTG TAACAGCCGT AAAACTGACC GGGAAGACGA ACTGATTACC
ACACTCGCAA ACCGATTAAG TACCCAGATG ATTCACTTCG TTCCCCGCGA CAACATCGTG
CAGCACGCAG AGTTACGCCG GATGACTGTG AACGAATACG CACCTGATAG CAATCAAGCT
AATGAATACC GCACATTAGC CGACAAGATT ATCAACAATC AAAATATGGC TGTTCCTACA
CCCATCGAAA TGGATGAGTT AGAAGCATTG TTGATTGAGT TCGGTATCCT TGAAAGCGAC
GAAGACAAGG AAAAATTGGT CGGTATGAGC AAAGCTGAAG AAGAAGCTCT CAAGAAGCAA
GAAGAACTCA AAGCTCAAGC ACTGGAAGCT GTGCAGAAAG GCAACGTTGA AGTTGTTTCC
CGTAACAATA AATAG
 
Protein sequence
MSDENIRQIA FYGKGGIGKS TTSQNTLAAM AEVGQRILIV GCDPKADSTR LILHTKAQTT 
VLHLAAERGA VEDLELDEVV LKGFRDIKCV ESGGPEPGVG CAGRGIITAI NFLEENGAYQ
DVDFVSYDVL GDVVCGGFAM PIREGKAQEI YIVTSGEMMA MYAANNIARG ILKYAHSGGV
RLGGLICNSR KTDREDELIT TLANRLSTQM IHFVPRDNIV QHAELRRMTV NEYAPDSNQA
NEYRTLADKI INNQNMAVPT PIEMDELEAL LIEFGILESD EDKEKLVGMS KAEEEALKKQ
EELKAQALEA VQKGNVEVVS RNNK