Gene Avin_50870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50870 
Symbol 
ID7763935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5158418 
End bp5160424 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content66% 
IMG OID643807915 
ProductRhs element Vgr protein 
Protein accessionYP_002802149 
Protein GI226947076 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGTC CCACCGACAT TACTACCAGC CTTTCTCTGA CCACTTCGGC CCTGGCGAAT 
CTGTTTCCGG AGCAGCTCTC CGGCGAGGAG CGGCTGAACG GCCTCGGTAG CCTGCAGTTG
CACAGCTATA GCGCCGCCGC ACCGACGCTG GACAGCGTGG TGGCTACGCA CCTGACCGCG
ACGCTGCACA ACGATGCCGA CTTGCGCCCG CTGGACGCGC TAATCGCGGA AGTCCGCCAA
TTGCCCGGCG ACGCGAGCGC CGACCGTTAC CAAGTACTCC TGCGCCCTTG GCTCTGGTGG
CTGACCCTGG CCAGCAACAA CCGGGTGTTC CAGAACAAGA CCACGGGCGA GATCGTCACC
GGCATCTTCG ATGGCCATGG CTTCGGCGAT TACCGGTTGA AGCTCTCCGG CAGCTACACG
CCGCGCGAGT ACTGCGTGCA GTACAGCGAG ACGGATTTCG CCTTCGTCTC GCGGCTGCTG
GAGGAGGAAG GCATCTTCTG GTTCTTCACC CATGAAGAGG GCAGGCACAC CCTGGTGCTG
GCCGACGGCA ACGATGCCTT CCCAGCGATC CCCAACGGGC CGAAAGTACC CTATCTGAGC
CAGGAAATCG GCGTGCGCGA GCTGCACGGC GTGCGTTCGG CGCAGTACTG CATCCAGGCG
GTGGCCGGGG CCTACAGTGC GACCGACTAC GAATTCACCA CGCCCACTAC CTCGCTCTAC
AGCCGGGCCG AGGCGCTGGC CGGGTCATTT TCCAGCTACG AGCATCCCGG CGGCTACATC
GCCAAGGCGC GCGGCGACGC GCTGACCAAG CAGCGGGTGG ACGGCCTGCG CAGCCAGGAA
AAGCGCCTGA TCGGCGAGAG CGACTGCCGC TGGCTGGTGC CGGGGCACTG GTTCACCCTC
ACCGGACACG ACGACGCCAA CCTGAACATC GACTGGGTGG TGACCTCGGT GACCCACGAG
GCCAGCCACG AGCACTACCG CAACCGCTTC GAGGCGATCC CGAAGGCCAC CAACTACCGT
CCCCCGCGCG CGACGCCCAA GCCACGCATG CACACCCAGA CGGCCAGGGT GGTCGGCAAG
GCCGGCGAGG AGATCTGGAC CGACCAGTAC GGCCGGATCA AGATCCAGTT CCCCTGGGAC
CGCGAGGGGC GGAACGACGA GACCAGTTCC TGCTGGGTGC GCGTGGTATT GCCCTGGAGC
GGCAAGAACT TCGGCATGCA GTTCGTTCCG CGCATCGGCC AGGAGGTGAT CGTCACCTTC
ATCGACGGCG ATCCGGACCG GCCGCTGGTC ACCGGCTGCG TCTACAACGG CGACAACGCC
CTGCCCTACG CGCTGCCGGC CAACCAGACG CAATCGGGCA TCAAGACCAA TTCGTCCAAG
GGCGGCGGCG GTTTCAACGA GCTGCGCTTC GAGGACAAGA AGGATGCCGA GGAGGTCTTC
CTGCAGGCGC AGAAGGACCT GAAGATCAAC GTGCTCAACG ACAGCACGGC CAGCGTCGGC
CACGACGAGA CGCTGACGGT GCAGAACGCC CGCACCCGCA CGGTGAAGGA CGGCGACGAG
ACCGTCACCC TGGAAAAGGG CAACCGTAGC GTCACCCTGC AGACCGGCAG CGATACCCTG
GACGTCAGGG ACAGCCGTAC GGTGAAGGTC GGCAGCGACC AGACCCACAG CACCGGCGGC
GACTACAGCC ACACGGTGAC CGGCGACTAC AGCCTGACGG TGAACGGCAA TCTGACCATC
AAGGTGAGCG GCACCCTGAC CCTGCAGAGC AGCGGCGACT TCACCGCCAG GAGCGACCTG
TCGCTGACCC AGCAGGCCGG TACCTCGATC AGCCAGAAGG CCGGCACCTC CTTCGCCAAC
CAGGCCGGTA CCTCGCTGGA CAACAAGGCC GGCACCACCC TGGTCAACGA CGCCGGCATC
AGCCTGACCA ACAAGGCCGG CGCCGAGCAG ACGGTGGATG GCGGCGGCCT GCTGACCCTC
AAGGGCGGCC TGGTCAAGAT CAACTGA
 
Protein sequence
MPRPTDITTS LSLTTSALAN LFPEQLSGEE RLNGLGSLQL HSYSAAAPTL DSVVATHLTA 
TLHNDADLRP LDALIAEVRQ LPGDASADRY QVLLRPWLWW LTLASNNRVF QNKTTGEIVT
GIFDGHGFGD YRLKLSGSYT PREYCVQYSE TDFAFVSRLL EEEGIFWFFT HEEGRHTLVL
ADGNDAFPAI PNGPKVPYLS QEIGVRELHG VRSAQYCIQA VAGAYSATDY EFTTPTTSLY
SRAEALAGSF SSYEHPGGYI AKARGDALTK QRVDGLRSQE KRLIGESDCR WLVPGHWFTL
TGHDDANLNI DWVVTSVTHE ASHEHYRNRF EAIPKATNYR PPRATPKPRM HTQTARVVGK
AGEEIWTDQY GRIKIQFPWD REGRNDETSS CWVRVVLPWS GKNFGMQFVP RIGQEVIVTF
IDGDPDRPLV TGCVYNGDNA LPYALPANQT QSGIKTNSSK GGGGFNELRF EDKKDAEEVF
LQAQKDLKIN VLNDSTASVG HDETLTVQNA RTRTVKDGDE TVTLEKGNRS VTLQTGSDTL
DVRDSRTVKV GSDQTHSTGG DYSHTVTGDY SLTVNGNLTI KVSGTLTLQS SGDFTARSDL
SLTQQAGTSI SQKAGTSFAN QAGTSLDNKA GTTLVNDAGI SLTNKAGAEQ TVDGGGLLTL
KGGLVKIN