Gene Noca_4743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4743 
Symbol 
ID4595467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp42626 
End bp44032 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content71% 
IMG OID639772532 
Producthypothetical protein 
Protein accessionYP_919192 
Protein GI119714050 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCT CCACCTATGG CGAGCACGCC GAGAACGCGG CCCAGGCGCT CAACGCCCTG 
ATCCGCGACG ACGTGATCCC GGCCGACGAG GCCGCGGTCG ACCAGCTGCT GCACTGCCGC
GAGGCCGTGG TCGACGCGCT GCGACAGCGG CTCTACGACG TCGGCCAAGA CAGCTGGTAC
CCGCCGCCGG ACCACCTTCC ATCTCGTACG CCGAAACCGA CACTGGCCGG GCTGGACGAG
AAGCTCGCGA CGCTGGTCGA CAACATCGCG TTCGCGCTCC CCACGCTCCC GCTCGACGAG
CGCCGCTCCC CCGCGGACTA CCTCACGCCC GCCTCGACCG ATCCGACGGT CGAGGCGTGG
CGCACCGCGG CCATCGAGCT GCTGGCGGCC TCCCACGCGC TGTCGGCGGC CGCCGAGCAG
CCTTGGCGCA CCGATCCGGG CACCGGCTGG TGGGTGATGC GCGACGTGTC GGTGGCCCTG
GAGGCGGTCC TGGTGCTCGA CTCCCGCCTC GAGGAGGTCG GCCTGCTGGC TGAGCATCAG
CGACCGGACT ACGCGATGGG GCTGGACGAG AAGCGCATGG TGCTCTCCCA GACAGCCCGG
GTCGCGACGT GGCACGCCAC GAGCGCGAGC CCCGACGAGG CCACCCCGCG TCTCCGGCAG
GCGGCGCCGT CGACGGTGGT CGAGCCGGTG TCGCTGGTCT CGACACCCGA CGACCTGGCG
GCCGCACAGC TCCGGCTTGC CCGGTTCCTG CGGCCACTCC ACGCCTCCGA CGCGTTCTAC
GCCCATGAGC CGGAGATCAC CGCCGACTCC GCACGGCAGG TCACGGCCAG CCAGCTCTAC
CTGTGCCGTG CGTTCGCGAA GGCCGCCGGT CTCTCGCCCA AGACCAGCAT GTTCGCGACG
TTCTTCGAGG AGCGCGCCGA GGTGCTGGAG TCGTTGCAGC CCCAGATCAG CCACCTGGCC
GACGTCTCAG AGGAAAGGGA CCCGAACATG CGCCGCTTCT GGCAGCAAGG CGAGCTCACG
TCCGCGGTTG CCCGGATGGA AGACCAAGGC GTGCCGATCC GGTTGCAGCC CACTCAGATG
GTCGAGTTGG CGAAGGCGAC CCACGAGGTC ACTTACAACC TGGGCAAGGC ACTGCGCCGC
GAACTGCTGC GCGGCAACAG CAACCTGCTC GACGCCCACC CACGCCACCG CGACGGGCCG
GTCCGCGTCG GCCGGCGCTC TCGGCTGGAG ACCACGCTGA CCGATCTGGT CAACATGCCC
GCTCCCAACG AGCCGGTGGC CCGGTTCAGC AGCCCGCTTC AACGGGCCGC ACTCCAACAA
GCACTGAACC TGACCCCCTC GTCGTCGCGA ACCCCGACGC CGTTCCCCGC GGCCCGCGCG
GCGACGTACG ACGCCCCAGC CTTCTGA
 
Protein sequence
MTVSTYGEHA ENAAQALNAL IRDDVIPADE AAVDQLLHCR EAVVDALRQR LYDVGQDSWY 
PPPDHLPSRT PKPTLAGLDE KLATLVDNIA FALPTLPLDE RRSPADYLTP ASTDPTVEAW
RTAAIELLAA SHALSAAAEQ PWRTDPGTGW WVMRDVSVAL EAVLVLDSRL EEVGLLAEHQ
RPDYAMGLDE KRMVLSQTAR VATWHATSAS PDEATPRLRQ AAPSTVVEPV SLVSTPDDLA
AAQLRLARFL RPLHASDAFY AHEPEITADS ARQVTASQLY LCRAFAKAAG LSPKTSMFAT
FFEERAEVLE SLQPQISHLA DVSEERDPNM RRFWQQGELT SAVARMEDQG VPIRLQPTQM
VELAKATHEV TYNLGKALRR ELLRGNSNLL DAHPRHRDGP VRVGRRSRLE TTLTDLVNMP
APNEPVARFS SPLQRAALQQ ALNLTPSSSR TPTPFPAARA ATYDAPAF