Gene Noca_3704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3704 
Symbol 
ID4597621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3931154 
End bp3932254 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content75% 
IMG OID639778312 
Producthypothetical protein 
Protein accessionYP_924891 
Protein GI119717926 
COG category[S] Function unknown 
COG ID[COG4129] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0978741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCGG GGCCGCTGGA CCGGATGTGG GACCGGAGCC GGACCTCGGT CCGGGCTCGG 
ATCCGCCGGC TGCAGGCCAA GTCCTTCCAG ATCGCGCAGT GTGCGCTCGC CGCCGCGGTC
GCCTGGTTGA TCGCGGCCGA CGTCCTGGGC CATCCCACCC CGTTCTTCGC CCCGGTGGCC
GCCGTGGTCG CCCTCGGCAC CTCCTACGGC CAGCGGCTGC GCCGGGTCGC CGAGGTCACC
ATCGGTGTGG CCGTCGGCGT CTTCCTCGCC GACCTGCTGG TGGTGTGGCT GGGCTCGGGC
TGGTGGCAGA TCGGGCTGAT CGTCGCGCTC GCGATGGCGG CGGCCTTCCT GCTCGACGGC
GGCCAGCTGT TCGTGACCCA GGCGGCGGTG CAGTCCATCG TGGTCGCGTC GCTCGTCCCG
GACCCGGGGG CGGCGTTCAC CCGGTGGACC GACGCGGTCG TGGGTGGCGC GGTCGCACTG
GTCGCGGCCA CCGCGGTGCC GGCCGCGCCG CTACGCCGGC CGCGCGAGCA GGCCGCGGTC
GTGATGCGCA AGATCGCAGC CCTGCTGCGG GCGGCCGGTG AGGTGATGGA CGACGGGGAG
GCGGCCCGGG CGCTCGACCT GCTCGCCGAC GCCCGCTCGA CCGACTACCT GATCCGCGAG
CTCAAGGACG CCGCCGACGA AGGCCTCTCC GTGGTCGAGT CCTCGCCGTT CCGGATCCGC
CACAAGGGCG ACCTGCGCAA GATGGTCGAC CTGGTCGACC CGCTCGACCG GGCGCTGCGC
AGCACCCGCG TCCTGGTCCG GCACACCGCG GTCGCGGCGT ACCGCCGGCG CCCGGTGCCG
TCGTCCTACT CGCTGCTCGC CCTCGACCTC GCCGACGCGG CCGACCGGGT TGCCGACGAG
CTGGCCGCGA ACCGGATGGC GGCCGCTGCC CGAGGCGTCC TGCTCGTGGT GGGCGAGGCC
ACCGGGCAGG TGGAGCGCAC CGACCTGCTC ACCGCCGAGG CGATCCTGGC CCAGCTGCGG
GCGGTCGTCG CGGACCTCCT GATGGTCACC GGGCTCGACC AGATGGAGTC GACCGAGGCC
CTGCCCCCGC CACCGCGATG A
 
Protein sequence
MEAGPLDRMW DRSRTSVRAR IRRLQAKSFQ IAQCALAAAV AWLIAADVLG HPTPFFAPVA 
AVVALGTSYG QRLRRVAEVT IGVAVGVFLA DLLVVWLGSG WWQIGLIVAL AMAAAFLLDG
GQLFVTQAAV QSIVVASLVP DPGAAFTRWT DAVVGGAVAL VAATAVPAAP LRRPREQAAV
VMRKIAALLR AAGEVMDDGE AARALDLLAD ARSTDYLIRE LKDAADEGLS VVESSPFRIR
HKGDLRKMVD LVDPLDRALR STRVLVRHTA VAAYRRRPVP SSYSLLALDL ADAADRVADE
LAANRMAAAA RGVLLVVGEA TGQVERTDLL TAEAILAQLR AVVADLLMVT GLDQMESTEA
LPPPPR