Gene Avin_37360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_37360 
Symboltmp 
ID7762629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3786701 
End bp3789937 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content67% 
IMG OID643806603 
ProductPhage TMP domain-containing protein 
Protein accessionYP_002800856 
Protein GI226945783 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGTG ATCTGAGTTT GCAAGTCCGC CTGAGCGCGA TAGACCGGAT CACCGCGCCG 
TTGCGCCGCA TCGTCCAGGG TAGCGGCGCC CTGGCCCAGG CGATGAAGGC CAGCCAGGAC
CAGCTCAAGG CCTTGAATCA GCAGCAGCGC GACTTGAGCG GCTATCGCCA GACGAATGTC
GAGATCGCCC GGCAGACCAA GGCTATCCAG GCACTGCAGG CCAGGACCCG CGAGCACACC
CAACTGCTGG AAAAGCAGCG GGCCGTGCAT GTCAACCTCA AGGGCAACCT GAAGGCGGCC
CAGACGCAGT ACAACAAGCT GGCCAAGGCA CTGATCGAGG GCAAGGGCGA GACGGCCAGC
TTCCACTTCG AGCTGGAGAA GGCCCAGATC AAGCTGCAAT CCGCCCAGCA GGCCTTCAAC
CGCTCGTCCA GCACGATCAA GACCTACAAG GACCGTATTC GCCAGGCTGA CAGTCAGCTT
GCCCAACTCG GCAGCCAGCA GCAGAACAGC CAGGAACGTC TGGCCGGCTA CAAGCGGCGG
CTCGACGAGG CCGGCATCGG GACGGAGCGC CTGGGCAGTC GGGCCCGGCA ACTGCGCGGC
GAACAGGAAC GCCTCAACGC CGTGCTCGAG GCGCAGAAAG CCCGGCTCGC TGCCGTCACC
GCGCAGCAGG AGCGGCTGAC CAAGGCACAG AAGAGCTACG AACGCGCCCA GGCCGTGGCC
GGCAAGATCG CCATCGGCGG GGCTGCCAGC TTGGCGAACG GCTACGCGCT GTCCCGCCCG
CTGTCTGCCG TGATGGACGC CTACGCGCCA GCCGAGGACG CCGCAGCCCA GTTGCGCGCT
TCGATGATGG GTGCCGATGG CAGCGTCTCG GCGGACTTCG AAAAGATCAG CGCCTTGGCC
ACCCGCTTGG GCGACCGCCT GCCCGGGACC ACCGCGCAGT TCCAGGAAAT GATGACCATG
TTGCGCCGGC AGGGCATCAG CGCACAGTCG ATCCTCGGCG GTACCGGCGA GGCGGCGGCC
TACCTGGCGG TCCAGCTCAA GATGGGCAGC AGCGAGGCGG CCGAGTTCGC GGCCAAGATG
CAGGACTCCA CGCGCACGAC TGAGGCTGAC ATGATGGGGC TGATGGACAC GATCCAGCGC
ACCTTCTATC TCGGCGTCGA TCCGACCAAC ATGCTCCAAG GGTTCGCCTC GATCTCGCCA
GCGCTGTCGA TGATCCGCAA AAGTGGGCTG GAGGCAGCCA ACACCCTGGC CCCGCTACTG
GTCATGATGG ATCAGGCCGG TATGTCCGGA GAGTCGGCCG GCAACGCCCT GCGCAACGTG
TTCCAGTCTG GATTCAAGAC TGACAAGGTT GCCAAGGCCA ACAAGATGCT GAAGAAGCTC
GGCATCAGCC TCGACTTCAC GGACGGAAAA GGCGAGTTCG GCGGACTGGA GAAGCTGTTC
GCCCAGTTGC AGAAGCTCCA GAAGCTGACC ACGGAAAAGC GCACCGCCGT CATCAGCGAG
ATATTCGGCG ACGACTCGCA GAACCTGCAG GTGCTCAATA CCTTGATTGA CAAGGGCCTG
GATGGCTACC GCGAAGTCGA GGCCAAGATG AAGGCCCAGG CCGACCTGCG CAAGCGCGTC
GACGACCAGC TCAAGACCCT GACCAATGTC ATCGACGCAG CCCAAGGCAG TTGGACCAAC
GCCATGGCTG AGTTCGGCGC AGCCGTGGCT CCGGAACTGA AGGGCTTGAT CCAGTGGCTC
GGCAACGTCG CCAGTGGCAT TGGCGCCTGG GCGCGGGAAA ATCCGCAACT GGCCGGGACG
CTGGTCAAGG TGACAGCAGG CATCGGGGCT CTGGCGGCGG CTGGTGGTGC GCTGGCCATC
GGCATGGCCG GGTTGATCGG GCCGTTCGCC ATGGCCAAGC TCGGCCTCAG CGTCTTCGGC
ATCCAGGCCG GTAGCGCCAT GGCGAGCACG GGGCTGTTGG GCAAGGCCCT GGGAGGACTT
TCGACCAGCC TGTCGGGGCT GGGCGCGGCC TGGCAAGCCG CCTCGCTCGG GACCGTCCTC
ACCGCGCTAC CGGGACGCCT CAAGGCCGCC GCCAGCGCGG CCAAGGCCTG GGTGGCCAGC
GCCGGTAGCG CCCTGGTCGG CAGTTTCCGG GCGGCCGGCA GCTCCGCCCT GGCATTCGCC
ACCGCGCCCT TGCGGTGGGT GATCAAGGGA CTGCGCGAGG CCGCGCTCGC GGCGTGGATG
AATATCCGCG TCAACGGCTT GCTCGGCGCC AGTTGGAACG GCATCAAGGC GGGGGCCGGT
GGCCTGCTGG CGGTGCTGCG CGGCGGCTTC TCCGCCGTGC TCGGCGGTGC CGCCGGCGCC
TTGCGTCTGT TCGGGCAGGC CATCGTGTTC GCGGGACGTG CCCTGCTGCT CAACCCCATC
GGCTTGACTA TCTCGGCGCT GGCCCTGGCC GGCATGGCCT TGCTCAAGTA TTGGCAGCCG
GTCAAGGCCT TTTTCGGCGG CTTCTGGCAA GGCTTCACCC AGGGGCTCGA ACCGCTGGCC
CCGGCCTTCG CGGCGCTGGG GAGCGCCTTG GCGCCGCTCA AGCCGCTCTG GGACGGTATC
GCCGGAGCCA TGTCGGCCGC GTGGCAGTGG GTCAGCCGGC TGTTCGCCCC GTTCCAGGCG
ACGGCCACCG AACTCCAGAA TGCCACCAGC CGGGGCCAGG CCTTCGGCCT CTGGCTGGCC
GGGCTGGTCA ACTCCCTGAC AGCCATCGCC GGCAAGATGT TCGGCTTCGG GGTCGACATT
GTGAAAGGGC TGATCAACGG CATTCTCAGC ATGAAGAATA CCGTCCTCGG TGTTATCGGT
GGTATTGGCA GCAGCATCAC CGGCTTGTTC TCGAAGGATC AGGAGATCCA CAGCCCGAGC
CGCGTATGGG CCCAGTTGGG CAACTACACG ATGCAGGGCC TCGAACAGGG CCTGCTGAAG
GGGCAGGGCG GGCCGCTCGG CGCCATCGCC GATCTGAGTC GGCAGCTCAC CCAGGCCGGC
GCGCTCACCG TGGGCCTGGG CGCGGCCGGC GGTGCGCTGG CGATCGACAA CCGCCCGCCG
CTGGCCGCTG GCGGTGCGGC GCCCATCGTC GTCCAGGGCG ACACCATCAC GATCAACCTC
CAGGTTGGTG CCGGCGGCAA CCCGGCCGAC CTGGCGCAAC AGATCAACCG CATCCTCGAC
GAGCGCGAGC GGGCCAAGGC CGCCCGCGTG CGCTCCCGCC TGCACGATCA GGAGTGA
 
Protein sequence
MARDLSLQVR LSAIDRITAP LRRIVQGSGA LAQAMKASQD QLKALNQQQR DLSGYRQTNV 
EIARQTKAIQ ALQARTREHT QLLEKQRAVH VNLKGNLKAA QTQYNKLAKA LIEGKGETAS
FHFELEKAQI KLQSAQQAFN RSSSTIKTYK DRIRQADSQL AQLGSQQQNS QERLAGYKRR
LDEAGIGTER LGSRARQLRG EQERLNAVLE AQKARLAAVT AQQERLTKAQ KSYERAQAVA
GKIAIGGAAS LANGYALSRP LSAVMDAYAP AEDAAAQLRA SMMGADGSVS ADFEKISALA
TRLGDRLPGT TAQFQEMMTM LRRQGISAQS ILGGTGEAAA YLAVQLKMGS SEAAEFAAKM
QDSTRTTEAD MMGLMDTIQR TFYLGVDPTN MLQGFASISP ALSMIRKSGL EAANTLAPLL
VMMDQAGMSG ESAGNALRNV FQSGFKTDKV AKANKMLKKL GISLDFTDGK GEFGGLEKLF
AQLQKLQKLT TEKRTAVISE IFGDDSQNLQ VLNTLIDKGL DGYREVEAKM KAQADLRKRV
DDQLKTLTNV IDAAQGSWTN AMAEFGAAVA PELKGLIQWL GNVASGIGAW ARENPQLAGT
LVKVTAGIGA LAAAGGALAI GMAGLIGPFA MAKLGLSVFG IQAGSAMAST GLLGKALGGL
STSLSGLGAA WQAASLGTVL TALPGRLKAA ASAAKAWVAS AGSALVGSFR AAGSSALAFA
TAPLRWVIKG LREAALAAWM NIRVNGLLGA SWNGIKAGAG GLLAVLRGGF SAVLGGAAGA
LRLFGQAIVF AGRALLLNPI GLTISALALA GMALLKYWQP VKAFFGGFWQ GFTQGLEPLA
PAFAALGSAL APLKPLWDGI AGAMSAAWQW VSRLFAPFQA TATELQNATS RGQAFGLWLA
GLVNSLTAIA GKMFGFGVDI VKGLINGILS MKNTVLGVIG GIGSSITGLF SKDQEIHSPS
RVWAQLGNYT MQGLEQGLLK GQGGPLGAIA DLSRQLTQAG ALTVGLGAAG GALAIDNRPP
LAAGGAAPIV VQGDTITINL QVGAGGNPAD LAQQINRILD ERERAKAARV RSRLHDQE