Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_04360 |
Symbol | |
ID | 7759395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 414057 |
End bp | 415205 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643803357 |
Product | Soluble hydrogenase, alpha or beta chain |
Protein accession | YP_002797667 |
Protein GI | 226942594 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.157479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCTC TGCCGACCGC GACCGCCGCC GTTCTCGACC GCGACGGCCT GCAGGCCCTG ATCGAAGCCC TCGCCGCCCG CGGCTTCCGG GTGTGCGGAC CGGTCGTGCG CGACGCGGCG ATCGTCTACG ACGAGATTTC CGGGGTGGCC GACCTGCCGG CCGGCTGGAC CGACCGTCAG GCGCCCGGCC GGTATCGCCT GGAGCGGCGT GCCGACCAGG CGCTGTTCGG CTTCGCCGCC GTGCCGCAGG CGTGGAAGCG CTTCCTGCAT CCGCCGGTCG AAACCCTCTG GCGGGTGCGC CAGGGCGAGG ACGGCATGCT GTTCAGCAGC GCCGTCGAGA CGGCGCCCCG GTACGCCTTT CTCGGTGTGC GTGCCTGCGA TCTCAGGGCC ATCGCCATCC AGGACCGGGT GTTCTGCGCG GACGACTGCC GCGATGACGC CTACGAGCGC CGGCGGCGCG ACGCCTTCAT CGTTGCGCTG AACTGCTCCG AGGCCGGCGA CACCTGCTTC TGCGTGTCCA TGGGCACCGG GCCCAGGGCC GAGGGTGCCT TCGATCTGGC GCTGACCGAG CTGCTCGACG CGAGCCGCCA CGAATTCCTG GTCGAGGTCG GCAGCGCGGC GGGCGAGGAA CTGCTGGCCG GCCTGCCTCG GCGAACGGCC ACCGAGCCCG ACCGGGCCGC GGCCGCGTCG GTCGTCGCAC GCACTGCCGG TCGCATGGGC CGCAGCCTGG AAACCGCCGG CCTGCAGGAG CTCCTGGAGA GCAATCCGGA GCATCCGCGC TGGGACGAGG TCGCCGAACG CTGCCTGGCC TGCGCCAACT GCACCATGGT CTGTCCGACC TGCTTCTGCA CCACGCTGGA GGATCACAGC GACCTGTCGG GCGGCTCCGC CGAGCGGGTG CGGCTGTGGG ACTCCTGCTT CACCCTGGAC TTCTCCTACA TCCACGGCGG CAGCGTGCGG CAGACCGACA AGGGCCGCTA CCGCCAGTGG ATGACCCACA AGCTGGCCAC CTGGTTCGAC CAGTTCGGCA GTTCCGGTTG CGTCGGCTGC GGGCGCTGCA TCACCTGGTG CCCGGTGGGC ATCGATATTA CCGAAGAGGC GGCGGCCATC CGCGCCACAC CGCATATCCA GGGAGGGGAC CATGGATAG
|
Protein sequence | MDALPTATAA VLDRDGLQAL IEALAARGFR VCGPVVRDAA IVYDEISGVA DLPAGWTDRQ APGRYRLERR ADQALFGFAA VPQAWKRFLH PPVETLWRVR QGEDGMLFSS AVETAPRYAF LGVRACDLRA IAIQDRVFCA DDCRDDAYER RRRDAFIVAL NCSEAGDTCF CVSMGTGPRA EGAFDLALTE LLDASRHEFL VEVGSAAGEE LLAGLPRRTA TEPDRAAAAS VVARTAGRMG RSLETAGLQE LLESNPEHPR WDEVAERCLA CANCTMVCPT CFCTTLEDHS DLSGGSAERV RLWDSCFTLD FSYIHGGSVR QTDKGRYRQW MTHKLATWFD QFGSSGCVGC GRCITWCPVG IDITEEAAAI RATPHIQGGD HG
|
| |