Gene Noca_4886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4886 
Symbol 
ID4595269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp217946 
End bp220069 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content71% 
IMG OID639772671 
Productphage integrase family protein 
Protein accessionYP_919331 
Protein GI119714189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.717526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACCTG CACTGACGAG CGACGAGGGC GACCGTGGCC GTGTCGGGGC ATCGATCCTG 
TACCACCGCA CCGTGCGGGC CGGCACCACG CCGAGCAAGC TGTCCCGCTT CGAGGACACG
GTCTGGCACC TGGCCCCCGC CCACCCCGAC GCGCACGCGA AGATCAACGC GATCCGATGG
GACCACTGGC CCGCCGAGCT GGTCGAGGTG TTCAAGACCG TCGCCCTGGC CTTCCTGGAA
CACCCCGTCC CGCGCAGCGT CACCGTCACC AGCGACGGGG AACCGATGAG CATCGGCACC
CTGGTCTTCC GGCTGCGCAC CCTGCACGTG TTCGCGGCCT GGATGAGCCA GCACCACCTG
CCCAGCCTGC ACGAGGTCAC CGACCAGCAT CTGGAACGCT ACCGCCGGCA CGTGCTCGGG
CTGGAGACCA GCAACCGCCG CAAACGTGAC CTGTTCATCG CGGTCCGCAC CGTCTGGGAC
TACCGTGCCT ACCTTCCCCC GCACTGCCGG CTGGACACCG ACAACCCCTG GGACGGCACG
CCGCCATCCA GGCTGGCCGC GGCCCCCTGC CGGCCGGCGG GCGCGGAGAA CACGACCCCG
CGCATCGCCC CGGCCACCAT GGAGGCCCTT CTGGGCTGGA GCCTGCGCAT GGTCGAGGTG
ATCGGCCCCG ACGTCGTGGC GGCCCGCGCC GAACTGGACC AGTTCGAGGC TGGCACCCAC
CCCAGCCAGG TGACTCTGGA CGGGTTGTCC TCCCGGCAGC GGCTCCGGGC CTTCGCCGCC
GACGCGAGCC GGCAGGGCCA CGCCCTACCC GGACACCTGA GCCGCGACTC GCAGGAGCAA
CCACGGATCA ACTGGAGCCA CCTCGCCAGG ATGCTGCAGC TGCCGCAACG CCAAGTCCCT
CCCAGCCTGC AGCCCGTCAT CCGCGACAGC GGCCTGCCCA TCGCCGACGG CTCACCCGTG
GGCACCATCA CCGGGCGCAT CGAGGAGAGA CCGTGGCGCG AACGACCCCT GACGACGCAA
GAACTCCCCG GCCTGGTCTC TCACCTGTCC GCGGCCTGCT TCGTAGTCAT CAGCTACCTG
TCGGGCATGC GACCCGGCGA AGTCCTCAAC CTACGCCGCG GCTGCACCCG CCAGGACGAG
ACGACCGGCC AGCTCCTGGT AGCCGGCCGG TGCGGCAAAG GACACGCCCG CACCCCGCGG
CCCACCCTCG GGGACGACCC CTGGGAGCGC ACCTGGGTCG TTGTGCGTCC CGTCCACCAG
GCCATCACCG TGCTCGAGCA GCTGACCGAC GCCCCCTTGC TGTTCCCGTC CAGCCTCCAC
AAGCCGCACG CGATCCGGCC CGCCGACCGG CATGCCCGCC GAAACGGTGA AATCACCCGC
GACCTGGGAA GCTTCATCGC CTGGGTCAAC GCCACCTTCC ACCGACTCGA CGGCCAGCCC
GCGATCCCCC CTGATCCGGC CGGGCGGATC TACCCCATCC GGTTTCGCCG TACGCTGGCC
TACTTCATCG TGCGTCGCCC CCGCGGGCTG ATCGCCGCCG CGTTGCAGTA CGGGCACGTG
TCCACCACCG TCACGCTCAG CTACAGCGGC AAGGCCGACA CCGGGTGGCT CGACGACCTG
GCCGTCGAAC GCCTGGAGGC CCTCCTCGAG CACAACCAGC ACGACCATGC CCTGCTGGCC
CGAGGGGAGC ACGTCAGCGG CCCCGCCGCC ACCGACTATC GCGACCGCGT GGAATCCGCT
CACCGGTTCG CCGGCCGCAC GGTCAACCGC GTCCGCAACG TCGAACGACT CCTGGCACAG
GCCGACCCCA GCATCCACCA CGGCGACGGG ATGACCTGCG TGTACCGGGC CGAGACCGCC
GCCTGCCGCA CTAGCCGGAT CGCTCACGGA CTTCCCGCTC CGGACGGCCC CCTCGAGGCG
GAATGCCAGT CCGGGTGCGT CAACCTGGCC TACACCGACC GCGACATCGC TCGCCTGCGC
GAACGCCTCA ACGTCCTGAA TGCCGCCGCC GACGACCAGA TGACACCAGC CCCACTGCGC
GACCGCGCAC GCGCCCAAGC CGACCAGGTC CGCGCCGTCC TCACACGACA CCAGACAGAG
CCCAAAGAAG GCCGCACATC ATGA
 
Protein sequence
MGPALTSDEG DRGRVGASIL YHRTVRAGTT PSKLSRFEDT VWHLAPAHPD AHAKINAIRW 
DHWPAELVEV FKTVALAFLE HPVPRSVTVT SDGEPMSIGT LVFRLRTLHV FAAWMSQHHL
PSLHEVTDQH LERYRRHVLG LETSNRRKRD LFIAVRTVWD YRAYLPPHCR LDTDNPWDGT
PPSRLAAAPC RPAGAENTTP RIAPATMEAL LGWSLRMVEV IGPDVVAARA ELDQFEAGTH
PSQVTLDGLS SRQRLRAFAA DASRQGHALP GHLSRDSQEQ PRINWSHLAR MLQLPQRQVP
PSLQPVIRDS GLPIADGSPV GTITGRIEER PWRERPLTTQ ELPGLVSHLS AACFVVISYL
SGMRPGEVLN LRRGCTRQDE TTGQLLVAGR CGKGHARTPR PTLGDDPWER TWVVVRPVHQ
AITVLEQLTD APLLFPSSLH KPHAIRPADR HARRNGEITR DLGSFIAWVN ATFHRLDGQP
AIPPDPAGRI YPIRFRRTLA YFIVRRPRGL IAAALQYGHV STTVTLSYSG KADTGWLDDL
AVERLEALLE HNQHDHALLA RGEHVSGPAA TDYRDRVESA HRFAGRTVNR VRNVERLLAQ
ADPSIHHGDG MTCVYRAETA ACRTSRIAHG LPAPDGPLEA ECQSGCVNLA YTDRDIARLR
ERLNVLNAAA DDQMTPAPLR DRARAQADQV RAVLTRHQTE PKEGRTS