Gene Noca_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3074 
Symbol 
ID4600191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3274022 
End bp3275296 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content75% 
IMG OID639777680 
Producthypothetical protein 
Protein accessionYP_924263 
Protein GI119717298 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.144067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAGG CGCTGGCCGG GCTGACGGTC CGTGGCCGGG CGTTCCTGGC GGCCGGCATC 
ACCGCCGTCG TGTGCGCGAT CGCCCTCGAC CAGACCGCCC TGACCCGGAT CGGCGTGCTC
GTCCTGGTCC TCCCGCTGCT CACGGCATGG GTGGTGGGGC GCAACCGGTA CCGGCTGGCC
CTGGTCCGCA CGGTCACTCC ACAGCTGGTC GCCGCGGGCC AGCCGGCCCG GGTGTCGCTG
GCCCTGAGCA ACGAGGGCCG TACGCCGAAC GGCGTGCTGC TGCTGGAGGA CCAGGTGCCC
TACGTGCTCG GCACCCGCCC GCGCTTCGTC GTCGAGGGCA TCGGCTCCGG CTGGCGGCAG
ACCGTCAGCT ACCAGGTCCG CTCCGACGTG CGCGGCCGCT TCGAGATCGG GCCGATGTCG
GTGCGGGTCA CCGACCCCTT CGGGCTGGTC GAGCTGGGCC GCACGTTCCG CACCACCGTG
CCGCTGACCG TGACGCCACG CACGGTGCCA CTCCCCCAGA TTCCGCTCGG TGGCGCCTGG
ACCGGGTCCG GTGACAACCG GCCCCGCGCC TTCGCCACCG GCAGCGCCGA GGACGTCACT
GTCCGGGAGT ACCGCCAGGG CGACGACCTG CGCCGGGTGC ACTGGCGCAG CTCGGCCCGG
CTCGGCGAGC TGATGGTGCG GCGCGAGGAG CAGCCGTGGC AGTCCCGGGC GACGCTGGTC
CTCGACAACC GGGTGCTGGC CCACCGGGGA CAGGGCATCG CATCCTCGCT GGAGGCGGCC
GTCTCCGCCG CCGCGTCGAT CGCGGTGCAC CTGAGCCACC GCGGCTTCGC CGTACGCCTG
GTCACCGCGC TGGGCGAGGA CCCCAGCAGC GCCTGGCACC TGCGCGACGC CGACCTCAAC
ACCGGGCCGC TGATGGAGGC CCTCGCGGTG GTGCAGGCGA CCCACCAGTC CCGCCTCGAC
ACTGCGTGGC TGGCCGAGGG TGCCCACGGC GGGCTGACGG TCGCCGTCTT CGGTGGCATC
CTGCCCGCCG ACCTCCCGGT GCTGCGCCGG ATGCAGCACC AGGCCGGGTC GGCACTCGCG
ATCGCGCTCG ACGTCGACGC CTGGACCGGC GCGCCGGCCG GCGTGGGCGC AACCCCGGCC
CTCGGCCAGC AGGGCTGGCG AGCGGTGCCG CTCGGCCCGC GCGACCGGCT CGAGTCCGTG
TGGCAGGAGC TCGGGCACAC GAGCGCGCAG CGCTCCCGCG TCGTCAGCCG GAGTGCCTCG
GAGGCGACCG TGTGA
 
Protein sequence
MREALAGLTV RGRAFLAAGI TAVVCAIALD QTALTRIGVL VLVLPLLTAW VVGRNRYRLA 
LVRTVTPQLV AAGQPARVSL ALSNEGRTPN GVLLLEDQVP YVLGTRPRFV VEGIGSGWRQ
TVSYQVRSDV RGRFEIGPMS VRVTDPFGLV ELGRTFRTTV PLTVTPRTVP LPQIPLGGAW
TGSGDNRPRA FATGSAEDVT VREYRQGDDL RRVHWRSSAR LGELMVRREE QPWQSRATLV
LDNRVLAHRG QGIASSLEAA VSAAASIAVH LSHRGFAVRL VTALGEDPSS AWHLRDADLN
TGPLMEALAV VQATHQSRLD TAWLAEGAHG GLTVAVFGGI LPADLPVLRR MQHQAGSALA
IALDVDAWTG APAGVGATPA LGQQGWRAVP LGPRDRLESV WQELGHTSAQ RSRVVSRSAS
EATV