Gene Mvan_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3894 
Symbol 
ID4647363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4162852 
End bp4164348 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content68% 
IMG OID639807358 
Productintegrase catalytic subunit 
Protein accessionYP_954679 
Protein GI120404850 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.484903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCA TTTCCGCCTA CCAACAGCTC GGGTCGTATC GGGCTGCCGC AGCGGAGTGC 
GGCACCACCC ATCGCACCGT GAAGAGGGTC GTCGACAAGT TCGAAGCCGA CCAGGCCGGC
GTGCGGCCGC CACCGCGGGC CGAACGGGCA CACAATTACG ACACCGTCTC GGGACTGGTC
GCCGAACGCG TGGACAAATC GAAGGGCCGG ATCTCGGCCA AGCGGCTGCT CCCGAAAGCA
CGCGCCGCCG GATACAGAGG TTCCGACCGT AACTTCCGAC GCCTCGTCGC GGAAGCGAAA
GCACTGTGGC GCAGTGAGAA TCATCGAGGT CGCCGTCCCG CGGTATGGGA ACCGGGTGAG
TATCTGGTGA TCGACTGGGC CCAGGCCGCG CCGGGATTGT TCTTGTTCTG CGCGGTGCTG
GCGTTCTCCC GCTGGCGGTT CGTGCGTTTC GCCGCCAATG AGCGGGCCTC GACCACGCTG
GCGTTGATCG CCGAGGCGCT GGCCGCGGCG GGGGGTGTGC CCGCGAAGGT GCTGGCCGAC
CGGATGGCCT GCCTCAAAGG AGGCGTGGTG GCCAACGTCG TGGTCCCCAC CCCGGACTAC
GTCCGGTTGG CCGGTCATTA CGGGTTCGCC CCGGATTTCT GTCACGCGAG CGATCCGCAT
TTCAAGGGCA TCGTGGAGAA CCTGTGCGGC TACGCCCAGG ACGATCTGGC CGTGCCGCTG
CTGACCGAGG CCGCCGTCAC CGCAACACCG ATCACGCTGC GTGCTGCGAA CGCCGCCGCC
GTGGCGTGGT GCGCCGAAGT CAACACAAGA GTGCATTCGG AGATCCACGC GATCCCCGAC
GAACGGCTGA TCGTCGAACG CGAACTGCTG CAACCACTCC CATCACTGCG ACTGCAGATC
GGGGCAGCGT CGGTGCTGCG CAAGGTCGAC CGGCTGTCGT GTGTCCGGTA CGGGTCGGCC
CGCTACTCGG TGCCGATGCG GTTGATCGGC ACCGCCGTAG CCGTGGTCGT CGACCACGGC
GCGATCGTCC TACTCGAGCC CGCCACCGGG GCGATCGTGG CAGAACACGA ACTCCTCGCC
CCCGGCGCAG TGTCGATCCT CGACGAGCAC TACGACGGAC CCCGGCCGGC GCCCAGTCGT
GGACCCCGGC CGAAGACCAC GGTGGAGAAG CAGTTCTGCG AGCTCGGTGA GGATGCTCAG
GCGTTCCTCA TCGGTGCCGC GGCGATCGGT AACACCCGAC TCGGGTCGGA GTTGGAGGTC
CTACTCGCCC TGGGTGCCGC CCACGGCACC GACGCCCTGA CCGCCGCCCT GCACCGGGCG
GTGGCGTTCC GCCGGTTCCG GGCCGCCGAT GTGCGGTCCA TCCTGGCCGT CGGGACCGGA
GCACCGCAGC CCCGCCCGGC CGGGGACGCG CTGATCCTGG ACCTACCAGT GGCCCCGACT
CGATCCCTGG ACGCCTACAA GTTCACCCCG GCCGTTGACG GCGAGGTGAT CCAGTGA
 
Protein sequence
MDIISAYQQL GSYRAAAAEC GTTHRTVKRV VDKFEADQAG VRPPPRAERA HNYDTVSGLV 
AERVDKSKGR ISAKRLLPKA RAAGYRGSDR NFRRLVAEAK ALWRSENHRG RRPAVWEPGE
YLVIDWAQAA PGLFLFCAVL AFSRWRFVRF AANERASTTL ALIAEALAAA GGVPAKVLAD
RMACLKGGVV ANVVVPTPDY VRLAGHYGFA PDFCHASDPH FKGIVENLCG YAQDDLAVPL
LTEAAVTATP ITLRAANAAA VAWCAEVNTR VHSEIHAIPD ERLIVERELL QPLPSLRLQI
GAASVLRKVD RLSCVRYGSA RYSVPMRLIG TAVAVVVDHG AIVLLEPATG AIVAEHELLA
PGAVSILDEH YDGPRPAPSR GPRPKTTVEK QFCELGEDAQ AFLIGAAAIG NTRLGSELEV
LLALGAAHGT DALTAALHRA VAFRRFRAAD VRSILAVGTG APQPRPAGDA LILDLPVAPT
RSLDAYKFTP AVDGEVIQ