Gene Maqu_4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMaqu_4200 
Symbol 
ID4653504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinobacter aquaeolei VT8 
KingdomBacteria 
Replicon accessionNC_008738 
Strand
Start bp96460 
End bp98649 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content54% 
IMG OID639809627 
ProductDNA topoisomerase I 
Protein accessionYP_956966 
Protein GI120536908 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGCAAA TACTGGTTAT CGTAGAGTCT CCGGCTAAGT CCAAGAAGAT TCAGACCATC 
TTGGGCTCGA ACTATCGTGT AGCTGCATCT GTTGGGCATA TGCGGGATTT ACCGCCCCGG
GACCTTGGTG TGGATTTGGA AACGCTCAAA CCCACTTATG TGGTATCCGA AGGTAAATCC
CAGGTCGTCA GCAATCTTAA ACGGCTTGCC GGAATGAGTG ACGAGGTAAT CCTTGCTACT
GACCCTGATC GAGAGGGGGA AGCGATTGCC TGGCATCTTA AAGTAGCGCT TCGGTTGCCT
GATGATGTGA AACGTGTGAG CTACCAGGAA ATCACCCGGG ATGCCATTAT GAAGGCCATG
GCCAATCCGG GGCGTATTGA TATGAAGTTG GTGGCTGCCC AGGAAAGCAG GCGCGTTCTG
GATCGTCTGA TTGGCTATCA GGTATCCCCT GCCCTTTCCA GTAAGGCGAA CCTTTCCCTC
AGTGCCGGGC GCGTGCAGTC CGTTGCGGTA AAGTTCGTGG TCGATCGTGA GCGCGAGATC
CAAGCATTTC AGCCTCGTGG CTACAACGTG TGCACGCTAA GACTGGCCGG CCATCCTAAT
GTATCTGCTT CTCTTGATAT GACTCCCTTC GTCCCAAAAG ATGAGCGGCT CTGGAAAGCC
TCGGATGCAC AGCCATTCGC AGGGCCGCAA CGAGTGAAAT TGGTGAAGGT CGAAAAGAAA
CCAAATGCTG TTAAGCCGAA GTCTGCTTTT ACAACAGTCG ATCTGCAGGG GGTTGCGGGG
AAAATATTTG GGCTCTCCGC AAAAGAGGTT ATGGCTGCCG CCCAAACCCT CTTCGAGCAA
GGTCTTATCA CCTATCATCG GACAGACTCA CCAAACCTGT CCGACGAGGG TATCGCCAAG
ATCCAATCCT ACCTTTCTGG TCAGGGTGTT CCGATTGCTG ATCAAGTGGT AAGGCACAAA
TCAAAAGGTG ATGCTCAGGA GGCCCACGAA GCCATACGGC CTACGGATGT GAGCGCCGCA
ACTGCTGGGC AAACTGATAC CGAAAAGAAT GTCTACTCCT TAATTCGGGA GCGGGCCATG
CTGAGTGTTA TGCCCAATGG CATAGATGCC GTCACTCAAT ATGTCTTCCA GTCCGAGCGT
CGAGTGCCAG GTTTAAGTGG TCGGCCGGTG AATCCCATGT ATATGGCAAA GGGAAGCGTT
GTTCAGGAAA AGGGGTGGCG CGCCTACGCG AAGCTTGAGC GAATCAAATC AAAAGACACA
CCCCTGCCGT TGCTTGAGCA AGGTCGCATC TATGACGGTT CCGTGGCGGC GGTCCAGAAA
ACGACAGAGC CACCGTCACG ATATAATGAG CAGACCTTAA TTAAGGCGCT GGAGGCGAAA
GGGATTGGGC GTCCTTCAAC CTACGCACAG ATCATGGAGA ACATTAAGAA TCGCTCTTAC
ATCGAGCCTA AACCCGGCTC TGGGAAGTCT CCCGCCTTTG TCCCGGGTAA GCTGGGATAC
TACATCGTTG ATGCCCTGTC TCGCTTTAGC TTTATGAGCT ACACCTATAC CCGTGCTGTC
GAGGCATCTC TGGACAAGGT GGCCCGAGGT TCAATGTCCT ACGTTGGGCT TGTCCGTCCC
GTTAGTGACC AACTTACCTC TGACATTGCA GAGCGTCTTG AGGCTGAGTC ATTAGCGCTG
AAAGGCCGCT GCCCGGGCTG CGATCAGCCC ATCATCCAAA AGCACCGCAA AGGTTCCGGC
CGTGGCAAGG GGCGGTCTCA AGGCTCGCGG GCGTTCTGGG TACACATTGA TGACGCTCAT
GCAGCTGCTT GCGTTCAGTA TCTCAACGAT GAGCATGATG CCCCAGTCCT TCCCCCTCCA
GAGGTAACAT CCCCTTGCCC TCAGTGTCAG GCAACTCTAA TCCGTCGTTA CAGCAAGACG
GGTAGTCGCT CACCGTACTG GGCACATGCC GAACGCGGGG ATGGAGATGC TTGCGGAGTA
AAATTCTTCC CCGATGTGGA CGGTCAGCCC GTAATCCCCG AACCAATCCC AGAGACTAAG
TGCGTAGATT GCGGCGGGAT CATGAAGAAA CGTAAAAACT CCAAGACGCA ACAGCCGGTA
TGGGTTCATG TGGCGAAAAA GCCTTCCTGC GGAAACTTAT TCATTGATGA CATTGATGGG
GTGCCGGCGA ATGCTGCGAA GAAAGCGTAG
 
Protein sequence
MGQILVIVES PAKSKKIQTI LGSNYRVAAS VGHMRDLPPR DLGVDLETLK PTYVVSEGKS 
QVVSNLKRLA GMSDEVILAT DPDREGEAIA WHLKVALRLP DDVKRVSYQE ITRDAIMKAM
ANPGRIDMKL VAAQESRRVL DRLIGYQVSP ALSSKANLSL SAGRVQSVAV KFVVDREREI
QAFQPRGYNV CTLRLAGHPN VSASLDMTPF VPKDERLWKA SDAQPFAGPQ RVKLVKVEKK
PNAVKPKSAF TTVDLQGVAG KIFGLSAKEV MAAAQTLFEQ GLITYHRTDS PNLSDEGIAK
IQSYLSGQGV PIADQVVRHK SKGDAQEAHE AIRPTDVSAA TAGQTDTEKN VYSLIRERAM
LSVMPNGIDA VTQYVFQSER RVPGLSGRPV NPMYMAKGSV VQEKGWRAYA KLERIKSKDT
PLPLLEQGRI YDGSVAAVQK TTEPPSRYNE QTLIKALEAK GIGRPSTYAQ IMENIKNRSY
IEPKPGSGKS PAFVPGKLGY YIVDALSRFS FMSYTYTRAV EASLDKVARG SMSYVGLVRP
VSDQLTSDIA ERLEAESLAL KGRCPGCDQP IIQKHRKGSG RGKGRSQGSR AFWVHIDDAH
AAACVQYLND EHDAPVLPPP EVTSPCPQCQ ATLIRRYSKT GSRSPYWAHA ERGDGDACGV
KFFPDVDGQP VIPEPIPETK CVDCGGIMKK RKNSKTQQPV WVHVAKKPSC GNLFIDDIDG
VPANAAKKA