Gene Noca_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1997 
Symbol 
ID4598313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2136688 
End bp2137848 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content74% 
IMG OID639776601 
ProductUspA domain-containing protein 
Protein accessionYP_923194 
Protein GI119716229 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.297843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGCG AGACGATCCC GGGCGGGACC ATCGTCGTGG GCGTCGACCA GACCCCCGCC 
GCGAGGGACG CGCTGGCCTG GGCGATCCAG GAGGCCGTGC ACGAGGAGAC GCCGCTGACG
CTGGTCCACG GCGTCGGTGC GGCGCTGTCC GCGTGGGTCG ACTTCTCGAC GATCGACTTC
CGCGACGTGA TCGACGCGAC CACCGCGACC GGGCGGGAGG TCCTCGACGC CGCGCAGGCC
CAGGTCGACC GGGTCGCGCC CGCGGTCGAG GTGCACCACG TGCTCCGCCT GACCGACCCG
CACCAGGCGC TGCTCGACCT CTCCGAGGAC GCGGCGATGC TGGTCGTGGG GTCCCGGACC
GCGCCGCACG AGCGCGAGTC GCTGATGTCC TCGGCGCTCG GCTTCGTCCG CGACCGTGCC
GGCTGCCCGG TGGTGGTCCC GCGGCTGCGC CGCAGCCCCG GCGTCGGCGT GGTCGTGCTC
TGCGACGGCT CGCCGGAGTC CCAGTCGATC CTCGCCTACG CGTGGGGCCA GGCCGACCGC
CGCGGCCTCC CGCTGACCGT CGTGCACTGC CTGCCCGACC CGCCGGCCGA CCTGCACCCC
CGCGACATCA GCCTGATGGA CGCAACGGCT CGCATGGAGG CCAGCCTCGG GCCCTCGCTG
CTCGGCTTCG AGCGGCTGCT GCTCAGCGAC CTGGTCCGTG ACATGCGGTC GCGCTGGCCG
GGCGTGGACG TCCGCCTGGT CGTCGAGGAC GACGCGATCG ACAGGTGGCT CGAGCGCGCC
CGGCAGCAGG CCGACCTGCT CGTGGTCGGC GCCAGGCACG CCCGCCGGCT GTCCGAGCTG
GTCATCGGCA GCTCCACGCC GGAGGAGGTG GAGTGCGTCA CCGTCGTCCT GCCGTTGGAG
GACCGGCTCG ATCCCGACGC GGACGCGGCG CGGGTGACCA TCCAGCGGCT CCATGGCTGT
GCCCAGCGGC TGGTCCGGCT CAACATCCCC CTCGACCAAG CGGTGGCCGA GATCCGCTCG
GTCACCATGG ACACCGACCT GCTCGCCGAG GCCGCCCTCA CCGCGCTGCG CGGCTGGGGC
GCCACGACCG CCAAGAGCTG GCAGACCCGC GAGGTCACCG AGCTGCTGGT ACGGGCCGGA
GCCCGCCGCG TCTGGCCCTG A
 
Protein sequence
MESETIPGGT IVVGVDQTPA ARDALAWAIQ EAVHEETPLT LVHGVGAALS AWVDFSTIDF 
RDVIDATTAT GREVLDAAQA QVDRVAPAVE VHHVLRLTDP HQALLDLSED AAMLVVGSRT
APHERESLMS SALGFVRDRA GCPVVVPRLR RSPGVGVVVL CDGSPESQSI LAYAWGQADR
RGLPLTVVHC LPDPPADLHP RDISLMDATA RMEASLGPSL LGFERLLLSD LVRDMRSRWP
GVDVRLVVED DAIDRWLERA RQQADLLVVG ARHARRLSEL VIGSSTPEEV ECVTVVLPLE
DRLDPDADAA RVTIQRLHGC AQRLVRLNIP LDQAVAEIRS VTMDTDLLAE AALTALRGWG
ATTAKSWQTR EVTELLVRAG ARRVWP