Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_04320 |
Symbol | |
ID | 7759391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 406566 |
End bp | 409922 |
Gene Length | 3357 bp |
Protein Length | 1118 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643803353 |
Product | transglutaminase domain protein |
Protein accession | YP_002797663 |
Protein GI | 226942590 |
COG category | [S] Function unknown |
COG ID | [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCGATCC ATGTCGCTTT GCACCATGTC ACCCACTATC GTTACGACCG GCTGGTCAAC GTGGGGCCGC AGATCGTCCG GCTGCGTCCG GCACCGCATA GCCGCACCCG CATCCTTTCG TATGCGCTGA AGGTGGCGCC GGGCGAGCAT TTCATCAACT GGCAGCAGGA CCCGCAGGGC AATTACCTGG CGCGTCTGGT GTTCCTGGAG AAGACCCGCG AGCTGAAGGT CGAGGTCGAC CTGGTCGCCG AGATGGCGGT GTTCAACCCC TTTGACTTCT TCCTCGAGCC CTACGCCGAA ACCATTCCCT TCGATTACAC CGAGGGCGAG CGGCGCGAGC TGGCGCCCTA CCGGGTGACG CTGCCGGCGA CGCCGCTGTT CGCCCGCTAT CTGGCCGGCA TCGAGCGGAA GCCGACCCGC AGCGTCGATT TCCTGGTCGA CCTGAACCAG CGCGTGGCGC GCGACGTGCG CTACCTGATC CGCCTGGAGC CCGGCGTGCA GAGCCCCGAG GAGACCCTGG AGAAGGCCTC CGGCTCCTGC CGCGACTCGG CCTGGCTGCT GGTGCAACTG CTGCGCCACC TGGGCCTGGC GGCGCGCTTC GTTTCCGGCT ACCTGATCCA GCTCAAGCCC GACGTGAAGT CGCTCGACGG TCCCGGCGGC GCCGAGGTGG ACTTCACCGA CCTGCACGCC TGGTGCGAGG TGTACCTGCC GGGCGCCGGC TGGATCGGCC TCGACCCGAC CTCCGGGCTG TTCGCCGGCG AGGGGCACAT CCCGCTGGCG TGCAGCCCCG AGCCGTCCTC GGCGGCGCCG ATCAGCGGGC TGGTCGACGA CTGCGAATGC GAGTTCTCCC ACGACATGCG CATCGAGCGT ATCTGGGAGG CGCCGCGGGT CACCCGGCCC TACAGCGAGG AACAGTGGCG GGCGATCCTC GACCTCGGCC ACCGGGTCGA CGCCGACCTG CTGCGCGGCG ACGTGCGCCT GACCATGGGC GGCGAGCCGA CCTTCATCGC CCTCGACTAC CCGGACGACC CCGAGTGGAA CACCGAGGCC ATGGGGCCGA ACAAGCGCCG CCTGGCCGCC GACCTGTTCC ATCGCCTGCG CGCGCATTAC GCGCCGCAGG GCCTGGCGCA CTTCGGCCAG GGCAAGTGGT ATCCGGGCGA GCAACTGCCG CGCTGGTCGC TCAATTGCTT CTGGCGCAAG GACGGCGAGC CGGTCTGGCA GGACCCGGCG CTGTATGCCG ACGAGACCCG TCACTACGGG GCCGACGCGC AGTTGGGCGC GCGCTTCCTC GACATCCTCT GCGGCTATCT CGGAATTTCC GGCGAGCATT TCTTCCCGGC CTACGAGGAC TGGCTGTACT ACCTCTGGCG CGAACGTCGC CTGCCGGAGA ACGTCACCCC CGAGGACGCG CGGCTGGCCG ATCCACTGGA GCGCGAGCGT CTGCGCCGGG TCTTCCGGCA GGGCCTGGAG CATGCGGTCG GACACGTCCT GCCGCTGATG CGCAGCCTCG ACGGCAGCCA CTGGCTGACC GGCGCCTGGT TCCTGCGCGA CGAATACTGC CGGCTGACCC CCGGCGATTC GCCCCTGGGC TGGCGCCTGC CGCTGGATTC GCTGCCCTGG GCCCGCGACA TGGATCAGCC CTACGTGCAT GCGCCGGATC CCAACCAGTC CTTCCCGCCG TTGCCGAGCC GTCAGCAGAT CCTCCGGCAA CTGCGCGGCA CGCCGCCCGC CCGGCCGGCC GCGACGGGCG AAGGGCCGGG CTCCGGCGCT TCCCCGCGGA GCCTCGGCGC TACGGGCGAA GGCTTGCGCC AGGCGACGGC GCCGGCGCCC TTCGAGTCGG CCGCCGGCAT CGTCCGCACC GCGCTCTGCG TCGAGCCGCG CAACGGCCGT CTGTACCTCT TCATGCCGCC GCTGGAGCGG CTGGAGGACT ATTTGGAGCT GGTCGCGGCC ATCGAGACCA CGTCCCGCGA GCTGGACTGC CCGGTGCTGC TGGAGGGCTA CGAGCCGCCG GCCGACCCGC GCCTGCGGCA TTTCCGCGTC ACCCCCGATC CGGGCGTGAT CGAGGTGAAC ATCCACCCGG CGGAGAACTG GGACGAACTG GTCGAGCGCA CCGAGTTCCT CTACGAGGCG GCGCGCCAGT CGCGACTGAC CAGCGAGAAG TTCATGATCG ATGGCCGCCA CGTCGGCACC GGCGGCGGCA ACCACTTCGT CCTCGGCGGC GCGACCCCGG CCGACTCGCC GTTCCTGCGC CGGCCGGATC TGTTGCGCAG CCTGATCGGC TACTGGCACA ACCACCCGTC GCTGTCCTAC CTGTTCTCCG GCCTGTTCAT CGGCCCGACC TCGCAGGCGC CGCGGGTGGA CGAGGCGCGC AACGACTCGC TCTACGAACT GGAGATCGCC TTCCGGCAGA TGCCCGAGCC GGGCACCGAC TGCCCGCCGT GGCTGGTCGA CCGTCTGCTG CGCAACCTCT TGGTGGATAT CACCGGCAAC ACCCACCGCG CCGAGTTCTG CATCGACAAG CTGTATTCGC CGGACAGCGC CAGCGGCCGC CTCGGCCTGC TCGAATTCCG CGCCTTCGAG ATGCCGCCGC ACGCGCAGAT GAGCCTGGCC CAGCAGGTGC TGCTGCGCTC CTTGATCGCG CGCTTCTGGG ACGAGCCGTA CCGGCCGCAG CGCCTGGTGC GCTGGGGCAC CGAACTGCAC GACCGCTTCA TGCTGCCGCA CTTCGTCGCC CAGGACTTCC ACGACGTGCT CTACGAGATG GACCGGCACG GCTATCCGTT GCGGCCGGAG TGGTTCGCCC CGCATTTCGA GTTCCGCTTC CCGAAACTCG GCGACTTCGC CGTGCAGGGC ATGGCGCTGG AGCTGCGTCA GGCGCTGGAG CCCTGGCATG TGATGGGCGA GGAAGGCGCC GTCGGCGGCA CCGCGCGCTA CGTGGACTCT TCCCTGGAGC GCCTGCAGGT GAGCGTGAGC GGCATGGCCC AGGACCGCTA CGTGTTGACC TGCAACGGCC AGCCGGTGCC GCTGCGCCCG ACCGGCACGG TCGGCGAGTT CGTCGGCGGC GTGCGCTATC GCGCCTGGCA GCCGGCTTCC AGCCTGCACC CGACCATCGG CGTGCACGCG CCGCTGGTGT TCGACTTGGT GGATACCTGG ATGCAGCGTT CCCTAGGCGG TTGCCAGTAC CATGTCAGCC ATCCGGGCGG GCTCAGCTAC GAGACCCTGC CGGTCAACGC CTACGAAGCG GAAAGCCGCC GTCTGTCGCG CTTCTTCCGT ATCGGCCACA CACCGGGCAA ACTGGTGCCC GCCGAGCCGG TCGAGAACAG GGAGATGCCG ATGACCCTGG ACCTGCGCCG CTGCTGA
|
Protein sequence | MSIHVALHHV THYRYDRLVN VGPQIVRLRP APHSRTRILS YALKVAPGEH FINWQQDPQG NYLARLVFLE KTRELKVEVD LVAEMAVFNP FDFFLEPYAE TIPFDYTEGE RRELAPYRVT LPATPLFARY LAGIERKPTR SVDFLVDLNQ RVARDVRYLI RLEPGVQSPE ETLEKASGSC RDSAWLLVQL LRHLGLAARF VSGYLIQLKP DVKSLDGPGG AEVDFTDLHA WCEVYLPGAG WIGLDPTSGL FAGEGHIPLA CSPEPSSAAP ISGLVDDCEC EFSHDMRIER IWEAPRVTRP YSEEQWRAIL DLGHRVDADL LRGDVRLTMG GEPTFIALDY PDDPEWNTEA MGPNKRRLAA DLFHRLRAHY APQGLAHFGQ GKWYPGEQLP RWSLNCFWRK DGEPVWQDPA LYADETRHYG ADAQLGARFL DILCGYLGIS GEHFFPAYED WLYYLWRERR LPENVTPEDA RLADPLERER LRRVFRQGLE HAVGHVLPLM RSLDGSHWLT GAWFLRDEYC RLTPGDSPLG WRLPLDSLPW ARDMDQPYVH APDPNQSFPP LPSRQQILRQ LRGTPPARPA ATGEGPGSGA SPRSLGATGE GLRQATAPAP FESAAGIVRT ALCVEPRNGR LYLFMPPLER LEDYLELVAA IETTSRELDC PVLLEGYEPP ADPRLRHFRV TPDPGVIEVN IHPAENWDEL VERTEFLYEA ARQSRLTSEK FMIDGRHVGT GGGNHFVLGG ATPADSPFLR RPDLLRSLIG YWHNHPSLSY LFSGLFIGPT SQAPRVDEAR NDSLYELEIA FRQMPEPGTD CPPWLVDRLL RNLLVDITGN THRAEFCIDK LYSPDSASGR LGLLEFRAFE MPPHAQMSLA QQVLLRSLIA RFWDEPYRPQ RLVRWGTELH DRFMLPHFVA QDFHDVLYEM DRHGYPLRPE WFAPHFEFRF PKLGDFAVQG MALELRQALE PWHVMGEEGA VGGTARYVDS SLERLQVSVS GMAQDRYVLT CNGQPVPLRP TGTVGEFVGG VRYRAWQPAS SLHPTIGVHA PLVFDLVDTW MQRSLGGCQY HVSHPGGLSY ETLPVNAYEA ESRRLSRFFR IGHTPGKLVP AEPVENREMP MTLDLRRC
|
| |