Gene Avin_14470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_14470 
SymboltopA 
ID7760383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1427388 
End bp1429994 
Gene Length2607 bp 
Protein Length868 aa 
Translation table11 
GC content66% 
IMG OID643804345 
ProductDNA topoisomerase I 
Protein accessionYP_002798638 
Protein GI226943565 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAT CGCTGGTCAT CGTGGAATCC CCGGCCAAGG CCAAGACCAT CAACAAGTAT 
CTGGGCAGCC AGTACGTGGT GAAGTCGAGC ATCGGCCATA TCCGTGACCT GCCCACCAGC
GGCTCGGCCA GTACCGCCAA GGAGCCGGCC AAGCGCGGCA AGGGCGTGGC CGAAGGGCCG
GCGTTGTCGC CCAGGGACAA GGCCAAGCGT CAGTTGTTCG CGCGCATGGG GGTCGATCCC
GAGCACGGCT GGCAGGCCCA TTACGAAATC CTGCCGGGCA AGGAAAAGGT GGTCGACGAG
TTGCGCCGCC TGGCCCGGGA AGCCGACACC ATCTATCTCG CCACCGACCT GGACCGCGAA
GGGGAGGCCA TCGCCTGGCA CCTGCGCGAA TCCATCGGCG GCGACGAGGA GCGCTACAAG
CGCGTGGTGT TCAACGAAAT CACCAAGAAG GCGATCCAGG AGGCTTTCTC CCAGCCGGGC
GACCTGGATA TCAACCGGGT CAACGCGCAG CAGGCGCGGC GCTTCCTCGA CCGGGTGGTC
GGCTACATGG TTTCGCCGCT GCTCTGGCAG AAGATCGCTC GCGGCCTGTC CGCCGGCCGC
GTGCAGTCGG TGGCGGTGAA GCTGATCGTC GAGCGCGAGC GGGAGATCCG CGCCTTCATC
CCGGAGGAGT TCTGGGAGGT GCATGCCGAC CTGGGCACCG CCCGCGGCGA CAAGGTGCGC
TTCGAGGTGG CCAGGGAGCA GGGCGAGGTC TTCCGCCCGC TCAACGAGGC CCAGGCCATG
GCCGCGCTGG AGAAGCTCAA GGCCTCCAGC TACCAGGTGC TCAAGCGCGA GGACAAGCCG
ACCCGCAGCA AGCCCTCGGC GCCCTTCATC ACCTCGACCT TGCAGCAGGC GGCGAGCAAC
CGCCTCGGCT TCTCGGTGAA GAAGACCATG ATGATGGCTC AGCGCCTGTA CGAGGCCGGT
TACATCACCT ACATGCGGAC CGACTCGACC AACCTCTCCG CCGATGCCCT GGAGATGGCG
CGCGGCTTCA TCGACAGCGA GTTCGGCGGG AAGTACCTGC CGGCCAAGCC CAATGTCTAC
ACCAGCAAGG AAGGCGCCCA GGAGGCTCAC GAGGCGATCC GTCCCTCCGA CGTCAACCTG
CGGCCGAACC AGTTGGCCGG CATGGAGCGC GACGCCGAGC GCCTCTACGA GCTGATCTGG
CGCCAGTTCG TCGCCTGCCA GATGCCGCCG GCCGAATACC TGTCGACCAA CGTCAGCGTC
CAGGCCGGCG ACTTCGAGCT GCGCGCCAAG GGCCGCATCC TCAAGTTCGA CGGCTATACC
CGCGTGTTGC CGCAATTGGC CAAGCCCGGC GAGGACGACG TGCTGCCGGA GATGAGCGAG
GGCGAACTGC TCGATCTGCT CAAGCTCGAT CCCAGCCAGC ATTTCACCAA GCCGCCAGCG
CGCTACAGCG AGGCCAGCCT GGTCAAGGAG ATGGAAAAGC GCGGGATCGG CCGGCCGTCC
ACCTACGCGG CGATCATCTC GACCATCCAG GAGCGCGGCT ACGTGACCCT GCAGAATCGC
CGCTTCCATT CGGAGAAGAT GGGCGAGATC GTCACCGAGC GGCTCGGCGA GAGCTTCGCC
AACCTGATGG ACTACGGCTT CACCGCTAGC ATGGAGGAGC ATCTGGACGA CGTCGCCCAG
GGCGAGCGCG ACTGGAAGAA CCTGCTCGAC GAGTTCTACG GCGATTTCCG CAGGAAGCTG
GAGGCGGCCG AATCCAGCGA GGCCGGCATG CGCGCCAACC AGCCGACCCT GACCGACATT
CCCTGCCGCG AATGCGGCCG GCCGATGATG ATCCGCACCG CCTCGACCGG CGTGTTCCTC
GGCTGCTCGG GTTACAACCT GCCGCCCAAG GAACGCTGCA AAGCGACCGT CAACCTGATC
CCGGGCGACG AGATCGCCGC GGACGACGAG GGCGAATCCG AATCCCTGCT GCTGCGCCAC
AAGCGTCGCT GCCCGAAGTG CGGCACGGCG ATGGACGCCT ATCTGCTCGA CGAGCGGCAC
AAGCTGCACA TCTGCGGCAA CAATCCGGAC TGCCCCGGCT ACGAGATCGA GGAGGGCCAG
TACCGCATCA AGGGCTACGA GGGGCCGACC CTGGAGTGCG ACAAGTGCGG TAGCGAGATG
CAGCTCAAGA CCGGCCGCTT CGGCAAGTTC TTCGGCTGTA CCAACGCCGC CTGCAAGAAC
ACCCGCAAGC TGCTGAAGAA CGGCGAGCCG GCGCCGCCGA AGATGGATGC GGTGAAAATG
CCGGAGCTGC GCTGCGAGAA GGTCGACGAT GTCTACGTGC TGCGCGACGG CGCTTCCGGC
CTGTTCCTCG CCGCCAGCCA GTTCCCGAAG AACCGCGAGA CCCGTGCGCC GCTGGTCCTG
GAACTGTTGC CGCATCGGGA CGAAATCGAC CCGAAGTACC ACTTCCTGCT GGAAGCCCCT
AGCCACGACC CGGAAGGGCG CCCAGCGGTG ATCCGCTTCA GCCGCAAGAC CAAGGAGCAA
TACGTGCAGA GCGAGGTCGA GGGCAAGCCC AGCGGCTGGC GCGCCTTCCA TCGGGACGGC
CGCTGGGTGG TCGAGGACAA GCACTGA
 
Protein sequence
MGKSLVIVES PAKAKTINKY LGSQYVVKSS IGHIRDLPTS GSASTAKEPA KRGKGVAEGP 
ALSPRDKAKR QLFARMGVDP EHGWQAHYEI LPGKEKVVDE LRRLAREADT IYLATDLDRE
GEAIAWHLRE SIGGDEERYK RVVFNEITKK AIQEAFSQPG DLDINRVNAQ QARRFLDRVV
GYMVSPLLWQ KIARGLSAGR VQSVAVKLIV EREREIRAFI PEEFWEVHAD LGTARGDKVR
FEVAREQGEV FRPLNEAQAM AALEKLKASS YQVLKREDKP TRSKPSAPFI TSTLQQAASN
RLGFSVKKTM MMAQRLYEAG YITYMRTDST NLSADALEMA RGFIDSEFGG KYLPAKPNVY
TSKEGAQEAH EAIRPSDVNL RPNQLAGMER DAERLYELIW RQFVACQMPP AEYLSTNVSV
QAGDFELRAK GRILKFDGYT RVLPQLAKPG EDDVLPEMSE GELLDLLKLD PSQHFTKPPA
RYSEASLVKE MEKRGIGRPS TYAAIISTIQ ERGYVTLQNR RFHSEKMGEI VTERLGESFA
NLMDYGFTAS MEEHLDDVAQ GERDWKNLLD EFYGDFRRKL EAAESSEAGM RANQPTLTDI
PCRECGRPMM IRTASTGVFL GCSGYNLPPK ERCKATVNLI PGDEIAADDE GESESLLLRH
KRRCPKCGTA MDAYLLDERH KLHICGNNPD CPGYEIEEGQ YRIKGYEGPT LECDKCGSEM
QLKTGRFGKF FGCTNAACKN TRKLLKNGEP APPKMDAVKM PELRCEKVDD VYVLRDGASG
LFLAASQFPK NRETRAPLVL ELLPHRDEID PKYHFLLEAP SHDPEGRPAV IRFSRKTKEQ
YVQSEVEGKP SGWRAFHRDG RWVVEDKH