Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_09340 |
Symbol | |
ID | 7759882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 882596 |
End bp | 883795 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643803846 |
Product | hypothetical protein |
Protein accession | YP_002798148 |
Protein GI | 226943075 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.035454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAGGC CTTTCGTCAT TCTCAGTCAC GTCGCCACCC ATGCCGTCTG CGAAGGCTTC CTGCCTGCCG CGCAGCGGCG CGGGCATCCG CTGCTGCTCA TCACCGACCA TGCCCAGGGG CATTGGCGCC ATCTGGCCGA GGCCGGGGTT TCGGTCGACG GGCTGGAGAT CCTCGAATGC GACGTCTTCA ACCCGCTGGC GGTGATCGAG TCGCTGCACG CCCATGGGCA ACGCCCGCTG GCGGTGTTCT CCAACAGCGA TCACCTGCAG ACCAGTTGCG CCCTGGTGGC CGAAGCCTTC GACTGCCCCG GCAAGGACTG GCGGACCTGC CACGCGGCGA AGAACAAGGC GGAGATGCGC GAGCGCCTGC AGCGCCTCGG CCTTCCCGGC CCCTGGTTCC GGGTGCTGAC CCCCGGCGCG GCGCTGCCGG CGGACGCCCC CTTCCCGCTG GTGGCCAAGC CGCGCGAGGG CGTCGCCAGC CTGGACGTGC AGTTGTGCCG CGACGCCGGC GAACTGGCCG CCTACTGCGA AGCCTTCTGG CAGCGCCAGC CGAACCGCGC GCTGCTGCTC GAAGCCTACC TGGAAGGACC GCTGTTCACC CTGGAGACCC TCGGCGACGG CCAGCGCCTG CAGGCCATCG GCGGCTTCGA CGTGACCCTC TCGTCACCGC CGCACTTCGT CGAGCTGGAG GCGCGCTGGA ACGGCCCGCT GGGTCGGGCG GCACGTAACG CCGCGCTGGC CCAGGTCGCC GCCTTCGGGG TCGACTTCGG CGTCTGCCAC AGCGAATTCA TCCTCACCGC CGACGGCCCG GTGCTGGTGG AGATCAACTA CCGCAGCATC GGCGACGGCC GCGAATTCCT CCTCGACCGC CTGCTGCCGC AGGGCTGGTT CGAACGCATC CTCGACCTGC ACCTGGGCGG CACCCTGGCC GACAACCAAA GCAGCAGCGC CGAAGCCCTG GTGCACTACC TGGTCGCCCC GCGCGCCGGC CGTCTGCTGG CGGCCAGCCC GAGCTTTCGC GACGAAGGCG ACGGCCATTG GGTCGACTAC TGCGCGCTGC GCGAGGTCGG CGAAGAGATC CGCCTGAGCA ACTCGAACAA GGACTACCTG GGCGTGCTGC GCCTGATCGC CCCCGACGCG GCCGCCCTCG CGGCACGCTT CGATGCCGTG CGCAGCGACC TGCGCTGGGA GCTGGCATGA
|
Protein sequence | MQRPFVILSH VATHAVCEGF LPAAQRRGHP LLLITDHAQG HWRHLAEAGV SVDGLEILEC DVFNPLAVIE SLHAHGQRPL AVFSNSDHLQ TSCALVAEAF DCPGKDWRTC HAAKNKAEMR ERLQRLGLPG PWFRVLTPGA ALPADAPFPL VAKPREGVAS LDVQLCRDAG ELAAYCEAFW QRQPNRALLL EAYLEGPLFT LETLGDGQRL QAIGGFDVTL SSPPHFVELE ARWNGPLGRA ARNAALAQVA AFGVDFGVCH SEFILTADGP VLVEINYRSI GDGREFLLDR LLPQGWFERI LDLHLGGTLA DNQSSSAEAL VHYLVAPRAG RLLAASPSFR DEGDGHWVDY CALREVGEEI RLSNSNKDYL GVLRLIAPDA AALAARFDAV RSDLRWELA
|
| |