Gene Noca_4786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4786 
Symbol 
ID4595391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp104853 
End bp106181 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content71% 
IMG OID639772573 
ProductIS4 family transposase 
Protein accessionYP_919233 
Protein GI119714091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.0321655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.753556 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGGCGC TGGCCGAGCA GGCCGGCCTC TCCGAGCTGG TCGCCGACCG GGTCACCTTG 
GCCTCGCTCT CGCCGCGGGT GGCCTCGGCG GGGGTGAACC CGGCCGGGAA GGTGACCTCG
ATCATCGCGG GGATGGCCGC GGGCGCGGAC AACATCGACG AGTTGCAGGT GATCCGCTCC
GGCGGGATGA AGCGACTCTT CGACCAGGTG TACGCGCCGG CCACGCTGGG CCAGTTCCTG
CGCGAGTTCA CCCACGGCCA CACCTTGCAG CTGGCCTCGG TCGCCCGGGC CCACCTGGTC
CACCTGGCGG CCCGGACCAA CCTGCTGCCC GGCATCGAGT CCCAGGCCTA CGTCGACATC
GACTCGCTGC TGCGCCCGGT CTACGGGCAC GCCAAACAGG GCGCCAGCTT CGGGCACACC
AAGATCGCCG GCAAGCAGGT GCTCCGTAAG GGTCTCTCGC CGCTGGCGAC CACGATCAGC
ACCGCCCAAG GGGCTCCGGT GCTGGCCGGG ATCCGGCTGC GTGGCGGGAA GACCGGCTCT
GGCAAGGGCG CGGCCTCGAT GGTCCGCGAG GCGATCAAGA CTGCTCGCGA CTGCGGTGCC
ACCGGCGAGA TCCTGGTGCG TGGTGACTCC GCCTACGGCA ACAGCGCAGT CGTGGCCGCG
TGCCTGAAGG CAGGCGTCCG GTTCTCGCTC GTGCTCACCA AGAACCCGGC GGTGTCCGCC
GCAATCGGCT CCATCCCCGA GGACGCCTGG ACCCCGGTCA CCTACCCCGG AGCCGTGATC
GATCCGGACA CCGGGGAGCT GATCAGCGAC GCGCAGGTCG CCGAGGTCGA GTTCACCGCG
TTCGCCTCCA CCGAGCACCC GGTCACCGCC AGGCTGGTGG TGCGGCGGGT CCGCGACCGC
GCCAAGCTCG ACGAGCTGTT CCCGGTCTGG CGATACCACC CGTTCCTCAC CAACAGCACG
CAGCCGACCG TGCAGGCCGA CCTGATCCAC CGGCGGCACG CGATCATCGA GACCGTCTTC
GCCGACCTGA TCGACGGGCC CCTGGCGCAC ATGCCCTCGG GACGGTTCGC GTCCAACAGT
GCATGGGCGA TCTGCGCGAT GATCACCCAC AACCTGCTCC GCGCCGCCGA CACCCTCGAC
CCCCACGCCG CTGCACCCGC GCGAGGCGCG ACGCTGCGCC GCCAGATCAT CCACGTCCCA
GCCCGGCTCG CCCGCCCGCA ACGCCGCCAT GTGCTGCACC TGCCCGCGCA CTGGCCCTGG
GCGAACCGCT GGCTGCGGAT CTGGACCGGC GTGTTCAGCC CCGCCCAAGC GCCACCACGC
GCGGCCTGA
 
Protein sequence
MMALAEQAGL SELVADRVTL ASLSPRVASA GVNPAGKVTS IIAGMAAGAD NIDELQVIRS 
GGMKRLFDQV YAPATLGQFL REFTHGHTLQ LASVARAHLV HLAARTNLLP GIESQAYVDI
DSLLRPVYGH AKQGASFGHT KIAGKQVLRK GLSPLATTIS TAQGAPVLAG IRLRGGKTGS
GKGAASMVRE AIKTARDCGA TGEILVRGDS AYGNSAVVAA CLKAGVRFSL VLTKNPAVSA
AIGSIPEDAW TPVTYPGAVI DPDTGELISD AQVAEVEFTA FASTEHPVTA RLVVRRVRDR
AKLDELFPVW RYHPFLTNST QPTVQADLIH RRHAIIETVF ADLIDGPLAH MPSGRFASNS
AWAICAMITH NLLRAADTLD PHAAAPARGA TLRRQIIHVP ARLARPQRRH VLHLPAHWPW
ANRWLRIWTG VFSPAQAPPR AA