Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_49350 |
Symbol | |
ID | 7763790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4995682 |
End bp | 4998351 |
Gene Length | 2670 bp |
Protein Length | 889 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643807771 |
Product | hypothetical protein |
Protein accession | YP_002802006 |
Protein GI | 226946933 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02608] delta-60 repeat domain [TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAT CGACACTGAG CGGTAGCGTG CCGAACCCTA CGCCCCGCGC GGCCACCGGC AAGATGCTCG TGGATGTGGG CGGCCACGAC GATTACAGCT ACGCCATGGG CCTGCAGCCC GACGGCGGGA TTCTGCTGGC GGGCTACAGC TACGACCCCG GCCGCCGGAA CTACGACACC AGCCTCGTCC GCCTGAATGC CGACGGCACC CTCGACACCG GCTTCGGCGA CCTCGGCAAG GTCGTCGTCG ACATCGGCGG GAGCGACGGG GCGCGCGGCC TGTCCCTGCA AGCCGACGGC AAGATCCTCC TCGCCGGCCT GAGCGCGAAC GACTTCGGGG CGATCCGCCT GAACGCCGAC GGCAGCCTCG ACACCGGTTT CGGCGTCGGC GGCAAGGTCA CCGTCGATAT CGCCGGCAGT TACGACCAGG CCAATGCCCT GGCCGTGCAG CCCGACGGCA AGATCCTGCT GGCCGGCAAC GGCCACAACG GCGCGAACTA CGATTTCAGC CTGATCCGCC TGAACGCCGA CGGCTCGCTC GACACCGGCT TCGGCAATGG CGGCAAGGCC AGCTTCGACG ACGGCGGCCA TGAGTTCGGC TACGGCCTGG CCCTGCAGGC GGACGGCAAG ATCCTGATGG TCGGGCAGAG CGGCTCCGAC ATCGGGGTGA TCCGCCTGAA CGCCGACGGC AGCCTGGATG CCGGCTTCGG CGACGGCGGC AAGGCCATTC TCGATATCGG CGGAGAGATC GACATCGGCC GCAGTCTGAG CCTGCTGCCC GACGGCAAGA TCCTGGTCGG CGGCTTCAGC TACGACATCA GCGGCGATTA TTACCACTAC AACTTCAGCC TGATCCGCCT GAACGCCGAC GGCAGCCTCG ATACCGGCTT CGGCGACGGC GGCACGGCCA TCGTCGATAT CGCCGGCGGC AACGACCAGG GCCACAGCCT GGTCCTACAG GACGACGGCA AAATCCTCCT CGCCGGCTTC AGCTACCTGC CGGACAGGGG CGCTTACGAC TTCAGCGTGA TCCGCCTGAA CGCCGACGGC AGTCTGGATG CCGGCTTCGG CGGCGACGGC AAGGTCACCG TCGATATCGC GCGCGGCTAC GACGAGGGCT ACAGCCTCGC CGTGCAGCCC GACGGCCGGA TTCTGCTCTC GGGCCTCGGC AACAACTCTG CCAGCGGCCG CTACGACTTC AGCGTGATCC GCCTGAACGC CGACGGCACG CTCGACACGA ATTTCGGCGC CCTGGACGAT GGCGTCGACC TCGTCGAAGG CAGCGACGGC GACGACGAAC TGCTCGGCGA CAGCGCCCGC GAACTGCTCC TCGGCAAGGA CGGCGACGAC CGCCTGGACG GTGGCGCCGG CGACGACACC CTCGACGGCG GCGCCGGGCG CGACAGCCTG AGCGGCGGCG GCGGCGCCGA CCTGTTCCGC TTTTCGAGCC GCGAGGACAG CCACCGCACC GCCAGCGAAG GCTTCGCCGA CCGGATTCGC GACTTCGACC CCGCCGAGGA CCGCATCGAC CTCTCGGCGC TCGGCTTCAG CGGCCTGGGC GACGGTCGCA ACGGCACCCT GGCCGTGCAG GTGAACGGTG CCGGCACGCG CACCTACCTG AAGAGCTTCG AGGCGGATGC CGAAGGCCGG CGCTTCGAGG TCGTCCTGGA CGGCGACTAC GCGGGCCTGC TCGATGACGG CCAATTGATC TTCGCGCCGC CGCGCCTCGA AGGCAGCGCG GCCGACGACG ATCTGATCGG CAGCGCGGCG GCGGAAATCC TCACCGGCGC GGACGGCGAC GACCGCCTGC ACGGCGCCGG CGGCGACGAT CTGCTCGACG GCGGCGCCGG GCGCGACCGG CTGACCGGTG GCAGCGGCGC CGACCTGTTC CGCATCGCCA ACCGCGAGGA CAGCCACCGC ACCGCCAGCG AAAACCTCGC CGACCGGATT CGCGACTTCG ACCCCGACGA GGACCGCATC GACCTCTCGG CGCTCGGCTT CAGCGGCCTG GGCGACGGCC ACGGCGGCAC TCTGCTGCTG CAGGTGAGCG GCGAGCGCAC CTACCTGAAG AGCTTCGAGG CGGATGCCGA GGGACGGCGT TTCGAGATCG CCCTGGACGG CGACCTCGCC GGCCGGCTCG ACAGCGGCAA TCTGCTCTTC GCCGCCGCGC CCCTCGAAGG CAGCGCGGCC GACGACGATC TGATCGGCAG CGCGGCGGCG GAAATCCTCA CCGGCGCGGA CGGCGACGAC TACCTGCACG GCGGTGCCGG CGACGACATC CTCGACGGCG GCGCCGGACG CGATACCCTG GCCGGCGGCA GCGGCGCCGA TCTATTCCGC TTCTCGGCCC GCGAGGACAG CCATCGCACC TCCGGCGAAA GCTTCGCCGA TCGGATCCTC GACTTCGAGG CCGGTACGGA TCGCATCGAC CTCTCGGCGC TCGGCTTCAG CGGGCTGGGC GACGGTCGCG ACGGTACCCT GGCCGTGCAG GTGAACGACA CCGGCACCCT CACCTACCTG AAGAGCTTCG AGACGGATGC CGAGGGACGG CGTTTCGAGA TCGCCCTGGA GGGCGACTAC GGCGGGCAGT TGTCCGCCGA CGCCATCCTC TTCGCCGCGC CCAACCAACT GGAAGTCATC GGCAGCGTGC CGCCCGAACA GGTCGGCTGA
|
Protein sequence | MNTSTLSGSV PNPTPRAATG KMLVDVGGHD DYSYAMGLQP DGGILLAGYS YDPGRRNYDT SLVRLNADGT LDTGFGDLGK VVVDIGGSDG ARGLSLQADG KILLAGLSAN DFGAIRLNAD GSLDTGFGVG GKVTVDIAGS YDQANALAVQ PDGKILLAGN GHNGANYDFS LIRLNADGSL DTGFGNGGKA SFDDGGHEFG YGLALQADGK ILMVGQSGSD IGVIRLNADG SLDAGFGDGG KAILDIGGEI DIGRSLSLLP DGKILVGGFS YDISGDYYHY NFSLIRLNAD GSLDTGFGDG GTAIVDIAGG NDQGHSLVLQ DDGKILLAGF SYLPDRGAYD FSVIRLNADG SLDAGFGGDG KVTVDIARGY DEGYSLAVQP DGRILLSGLG NNSASGRYDF SVIRLNADGT LDTNFGALDD GVDLVEGSDG DDELLGDSAR ELLLGKDGDD RLDGGAGDDT LDGGAGRDSL SGGGGADLFR FSSREDSHRT ASEGFADRIR DFDPAEDRID LSALGFSGLG DGRNGTLAVQ VNGAGTRTYL KSFEADAEGR RFEVVLDGDY AGLLDDGQLI FAPPRLEGSA ADDDLIGSAA AEILTGADGD DRLHGAGGDD LLDGGAGRDR LTGGSGADLF RIANREDSHR TASENLADRI RDFDPDEDRI DLSALGFSGL GDGHGGTLLL QVSGERTYLK SFEADAEGRR FEIALDGDLA GRLDSGNLLF AAAPLEGSAA DDDLIGSAAA EILTGADGDD YLHGGAGDDI LDGGAGRDTL AGGSGADLFR FSAREDSHRT SGESFADRIL DFEAGTDRID LSALGFSGLG DGRDGTLAVQ VNDTGTLTYL KSFETDAEGR RFEIALEGDY GGQLSADAIL FAAPNQLEVI GSVPPEQVG
|
| |