Gene Avin_30640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_30640 
Symbol 
ID7761964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3171942 
End bp3173243 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID643805940 
ProductHipA protein 
Protein accessionYP_002800204 
Protein GI226945131 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGCA GGCTGCTGGC CTGGATCGAC CGCCAGCCGG TCGGCACCCT ACGCGACCGC 
GACGGCGTCT GGTCCTTCCA GTACGCGCCA GCATGGCTGG AAGCAGCCGA CAACTTCGCC
CTGTGCCCCG GCCTGCCCCT GCAAGCCGGA GAACACCGGG ATGGCGGTTC GCGACGGCCG
GTGCAGTGGT ACTTCGACAA CCTGCTGCCC GAGGAAGGCC AACGGTTGCT GCTGGCCGGC
ACCGCCAGAG TGGATGGCAG CGATGCGTTC GGCCTGCTCG GCCACTACGG CGCCGAATCC
GCCGGCTCGC TGACCCTGCT CCCAGCGGAC GGCGAACAAC GGGAAGGCGG CCTGCGGCCG
CTGAGCAATG CCGACCTGAG CGCGCGCATC GCCGCCATGC CGGGCGTCCC GCTGGTCGAG
GGCGCGCCCA AACGCATGTC GCTGGCCGGT GCCCAGCACA AGCTGGCGGT GGTGCTGCAG
GATGGCGAAC TGTTCGAGCC GTCCGGGCGG ATGCCTTCGA CGCATATCCT CAAGCCCGAC
CATCCGCACG ACTCCTACGC GCACTCGGTC GTCAACGAAT GGTTCACGAT GGCGCTGGCC
CGGCGCCTCG GTCTCGCGGT GCCGCGTGTG GAGCGCCGCT ACGTGCCCCA GCCGGTGTAC
CTGATCGAGC GTTTCGACCG CCAGCAGGAA ACCGGGGGCT GGCGCCGCCT ACACAGCATC
GACGCCTGCC AGATGCTGGG CTTGAGCGCA GCCTACAAAT ATCTCGAAGG TAGCGTGGCC
CGGCTGACCG AACTGGCCGG CGCCTGCCGC AGCCCCGCAG TGGCACGCAC GGCGCTGTTC
CAGTGGCTGG TGTTCAACCT GCTGGTCGGC AACACCGATG CGCACCTGAA GAACCTCGGC
TTCCTGGTGT CCCATGGCGG CATCCGACCG GCGCCTTTCT ACGACCTGAT CTGCACCGCC
GTGTACGACA CGTCGGCTTT CGACAATGGC CGCTGGCCGG CCGCCACGAC CCTGGCCTGG
CCGCTGGAAG GCCGCGCGCG TATCGCCGAG GTCGACCGCC GCTGCCTGCT GGAGGCAGGC
GAGACCATGC GGATCAAGCC GGCCACGGCC ACGCGCCTGA TGGATCGCCT GAGGAGCAGG
ATCGCCGACG AGGCACGGGC GCTCCATGCG CAGGTCGAGC GGGAAAATGC CGCGCTGATC
GCCCGTCGCC CGGAACTGGC GGCGACTTTC GCTGGCGAGA TGCGTTGCCT GCGGGCGATC
ATCCATGTGG TGATCGCGGA ACAGGTGGCG CGCCTGGGCT GA
 
Protein sequence
MERRLLAWID RQPVGTLRDR DGVWSFQYAP AWLEAADNFA LCPGLPLQAG EHRDGGSRRP 
VQWYFDNLLP EEGQRLLLAG TARVDGSDAF GLLGHYGAES AGSLTLLPAD GEQREGGLRP
LSNADLSARI AAMPGVPLVE GAPKRMSLAG AQHKLAVVLQ DGELFEPSGR MPSTHILKPD
HPHDSYAHSV VNEWFTMALA RRLGLAVPRV ERRYVPQPVY LIERFDRQQE TGGWRRLHSI
DACQMLGLSA AYKYLEGSVA RLTELAGACR SPAVARTALF QWLVFNLLVG NTDAHLKNLG
FLVSHGGIRP APFYDLICTA VYDTSAFDNG RWPAATTLAW PLEGRARIAE VDRRCLLEAG
ETMRIKPATA TRLMDRLRSR IADEARALHA QVERENAALI ARRPELAATF AGEMRCLRAI
IHVVIAEQVA RLG