Gene Noca_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3937 
Symbol 
ID4598072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4144769 
End bp4146388 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content69% 
IMG OID639778542 
ProductAMP-binding domain protein 
Protein accessionYP_925121 
Protein GI119718156 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCCT ACGCGAAGGG CGAGCTCGAG CCGCCCCTCC TCGAGGAGAC CATCGGCGCG 
AGCTTCGAGC GGACGGTGAC CGCGTACGCC GACCGCGAGG CGCTGGTCGA GGTGGCGAGC
GGCCGGCGCT GGACCTGGGC CGAGCTGGAC CGCGACGTCG ACGACCTGGC GCGGGGGCTG
GTGGCCGCCG GGATCGGCAA GGGCGACCGG GTCGGGATCT GGGCGCCCAA CTGCGCGGAG
TGGACGGTCG TCCAGTACGC GACCGCCAAG CTCGGCATCA TCCTGGTCAA CGTCAACCCG
GCGTACCGCA CGCACGAGTT CTCCTACGCG GTCAACCAGA GCGGCCTGCG GCTGCTGATC
AGCGCGTCGA CGTTCAAGAC CAGTGACTAC CGCGCGATGG TCGAGGAGAC CGCGGCGCAG
ACCCCGACCC TCGAACGGGT CGTCTACCTC GACACCGACG ACTGGGCGCA GCTCGTCGAC
GCCGGCCGGA CGCTGCCCGA GGGCGTCGTC GCGGACCGGC TGGCGCAGAC CGCCCCCGAC
GAGCCGATCA ACATCCAGTA CACGTCGGGC ACGACCGGCT ACCCCAAGGG CGCGACCCTG
AGCCACCGCA ACATCCTCAA CAACGGCTAC TTCACCACCG AGCTGATCCA CCTCGGCCCC
GAGGACCGGC TGTGCATCCC GGTGCCCTTC TACCACTGCT TCGGGATGGT GATGGGCAAC
CTCGGGTGCA CCAGCCACGG CACCACGATG GTGATCCCCG CGCCGGGCTT CGACCCCGAG
ATCACCCTGC GCACGATCGC CGCGGAGCGC TGCACCGGCG TGTACGGCGT GCCCACGATG
TTCATCGCGA TGCAGAACCA CCCGACCTTC GCCGAGCACG ACCTCTCCAG CCTGCGCACC
GGGATCATGG CCGGCTCGAT CTGCCCGGTC GAGGTGATGA AGCGCTGCGT CGATGACATG
CACATGGCCG AGGTCGCGAT CGCCTACGGC ATGACCGAGA CCAGCCCGGT GTCCTGCCAG
ACGCGTGCCG ACGACGACCT GGAGCGGCGT ACCGCCACCA TCGGGCGGGT GCACCCGTAC
GTCGAGATCA AGATCGTCGA CCCGGTGAGC GGCGAGACCG TCGAGCGGGG GCGAACCGGT
GAGTTCTGCA CCCGCGGCTA CTCGGTGATG CTCGGCTACT GGGACGATCC CGAGAAGACC
GCCGAGGCGG TCGATGCCGA CGGCTGGATG CACACCGGCG ACCTCGCCGA GATGCGCGAG
GACGGCTATT GCAACATCGT CGGACGGATC ACGGACATGG TGATCCGGGG CGGGGAGAAC
ATCTACCCGC GTGAGATCGA GGAGTTCCTC TACCAGCACC CCGACATCGA GGACGTGCAG
GTGATCGGCG TCCCGGACGA GCGGTACGGC GAGGAGCTGT GCGCCTGGGT GCGGATGCGT
GCCGGGGCCG AGCCGCTCGA CGCGGACGCC GTGCGCGCGT TCGCCACCGG ACGGCTCTCG
CACTACAAGA TCCCCCGCTA CGTCCTGGTG GTGGACGAGT TCCCGATGAC GGTGACCGGC
AAGATCCGCA AGGTGCAGAT GCGTGAGGAG AGCGCGAAGC GACTCGGCCT CCGTGCGTGA
 
Protein sequence
MEAYAKGELE PPLLEETIGA SFERTVTAYA DREALVEVAS GRRWTWAELD RDVDDLARGL 
VAAGIGKGDR VGIWAPNCAE WTVVQYATAK LGIILVNVNP AYRTHEFSYA VNQSGLRLLI
SASTFKTSDY RAMVEETAAQ TPTLERVVYL DTDDWAQLVD AGRTLPEGVV ADRLAQTAPD
EPINIQYTSG TTGYPKGATL SHRNILNNGY FTTELIHLGP EDRLCIPVPF YHCFGMVMGN
LGCTSHGTTM VIPAPGFDPE ITLRTIAAER CTGVYGVPTM FIAMQNHPTF AEHDLSSLRT
GIMAGSICPV EVMKRCVDDM HMAEVAIAYG MTETSPVSCQ TRADDDLERR TATIGRVHPY
VEIKIVDPVS GETVERGRTG EFCTRGYSVM LGYWDDPEKT AEAVDADGWM HTGDLAEMRE
DGYCNIVGRI TDMVIRGGEN IYPREIEEFL YQHPDIEDVQ VIGVPDERYG EELCAWVRMR
AGAEPLDADA VRAFATGRLS HYKIPRYVLV VDEFPMTVTG KIRKVQMREE SAKRLGLRA