Gene Noca_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0953 
Symbol 
ID4597486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1001486 
End bp1003498 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content73% 
IMG OID639775556 
Productvon Willebrand factor, type A 
Protein accessionYP_922163 
Protein GI119715198 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGACC GGACCCGGTT CAAGCGGTAC GACGGCGGAC CGGACCCGCT CGCGCCGCCC 
GTCGACCTGG CCGAGGCCCT GGACGCCATC GGCGAGGACG TGATGGCCGG GTACTCCCCG
GAGCGGGCGA TGCGGGAGTT CCTGCGCCGC GGTGGGCGCG ACCAGTCCGG TCTCGACGAG
CTGGCCCGCC GGGTGGCGGA GCGGCGTCGC GAGCTCACCC AGCGGCACCA CCTCGACGGC
ACCCTGAACG AGGTGCGGGA ACTGCTCGAC CGGGCGTTGC TCGAGGAACG CAAGCAGCTG
GCCCGCGACG CGATGATGGA CGACGCCGAC CGGGCCTTCC GTGAGCTGCG CCTGGAGAAC
CTGCCCTCGT CGACGGCGGC GGCGGTGAGC GAGCTGTCGT CGTACGACTG GCAGAGCCGG
GAGGCGCGCG AGAGCTACGA GAAGATCAAG GACCTGCTCG GCCGCGAGCT GCTCGACCAG
CGCTTCGCCG GCCTGAAGCA GGCACTCGAG AACGCGACGG ATGCGGACCG CGCGGCGGTC
AACGAGATGC TCCAGGACCT CAACGACCTG CTCGACAAGC ACCGGCGCGG CGAGGACACC
CAGGCCGACT TCGACCGGTT CATGGCCGAG CACGGCGACT TCTTCCCCGA GCGGCCCAAG
GACATCGACG AGCTGCTGGA CGCGCTGGCG CAGCGTTCCG CCGCCGCCCA GCGGATGCTG
AACTCGATGT CGCCCGAGCA GCGCCAGGAG CTGATGGAGC TCTCGGCGCA GGCGTTCGGG
TCGCCCGAGC TGATGGATCA GCTCGCGCGC ATGGACGCCA ACCTGCAGGC GCTGCGGCCG
GGGGAGGACT GGGGCGGCTC GGAGCGCTTC GACGGCGAGG AGGGGCTCGG GCTCGGCGAC
GGCACCGGGG TGCTCCAGGA CCTCGCCGAC CTCGACGACC TGGCCGACCA GCTCTCCCAG
TCGTACGGCG GCGCCCGCAT GGACGACCTC GACCTCGACA AGCTCGCCCG CCAGCTCGGC
GACGAGGCCG CGGTCGACGC CCGTACCCTG CAGCGGCTCG AGCAGGTGCT GCGCGACTCC
GGCTACCTCA AGCGCGGCAC CGACGGCCAG CTCCGGCTCT CACCGAAGGC GATGCGCCGG
CTCGGCAAGG CGCTGCTGCG CGACGTCGCC GAGCGGATCT CGGGCCGCCA GGGCCAGCGC
GACCTGCAGC GCGCCGGCGC GGCCGGCGAC CTGTCGGGCG CGACCCGGGA GTGGGCGTTC
GGCGACACCG AGCCCTGGCA CGTGCCGCGC ACCATGCTGA ACGGCGTGCT GCGCCGCGCC
GGCGATCCGT CCGCCTCGCT GCTGGACATC GGCGACGTGG AGGTGCAGGA GACCGAGGCC
CGCACCCAGG CCGCCGTCGC GCTGCTGGTG GACACCTCGT TCTCGATGGC GATGGATGGC
CGCTGGGTCC CGATGAAGCG GACCGCGCTC GCGCTGCACA CGCTGATCCG CTCCCGGTTC
CGCGGCGACC ACCTGCAGCT GATCGGCTTC GGCCGGCACG CCGAGGTGAT GGAGATCGAG
CAGCTGACCG CCCTCGACGC CCGCTGGGAC AAGGGCACCA ACCTGCACCA CGCCCTGCTG
CTCGCCAACC GGCACTTCCG CAAGCACCCG AGCGCCCAGC CGGTGCTGCT CATCGTCACC
GACGGCGAGC CGACCTCCCA CCTCGAGCCC GACGGCGAGG TCTACTTCTC CTACCCGCCG
CACCCGCTCA CCGTGGCGTA CGCCGTCCGC GAGCTCGACG CCGCCGGCCG GCTCGGCGCG
CAGACGACGT TCTTCCGGCT CGGCGACGAC CCGGGCCTGG GCCGGTTCAT CGACCAGATG
GCCCGCCGCG TCGACGGCCG GGTGGTCGCT CCCGAGCTCG ACGACCTCGG CGCCGCGGTC
GTCGGCTCCT ACCTCGGCTC CCGCCGGCCC AGCGCGGGCT ACGGCGGCTA CGGCGACTGG
TTCGGCGGCC GCGGCTTCTG GGTCGGCGAC TAG
 
Protein sequence
MKDRTRFKRY DGGPDPLAPP VDLAEALDAI GEDVMAGYSP ERAMREFLRR GGRDQSGLDE 
LARRVAERRR ELTQRHHLDG TLNEVRELLD RALLEERKQL ARDAMMDDAD RAFRELRLEN
LPSSTAAAVS ELSSYDWQSR EARESYEKIK DLLGRELLDQ RFAGLKQALE NATDADRAAV
NEMLQDLNDL LDKHRRGEDT QADFDRFMAE HGDFFPERPK DIDELLDALA QRSAAAQRML
NSMSPEQRQE LMELSAQAFG SPELMDQLAR MDANLQALRP GEDWGGSERF DGEEGLGLGD
GTGVLQDLAD LDDLADQLSQ SYGGARMDDL DLDKLARQLG DEAAVDARTL QRLEQVLRDS
GYLKRGTDGQ LRLSPKAMRR LGKALLRDVA ERISGRQGQR DLQRAGAAGD LSGATREWAF
GDTEPWHVPR TMLNGVLRRA GDPSASLLDI GDVEVQETEA RTQAAVALLV DTSFSMAMDG
RWVPMKRTAL ALHTLIRSRF RGDHLQLIGF GRHAEVMEIE QLTALDARWD KGTNLHHALL
LANRHFRKHP SAQPVLLIVT DGEPTSHLEP DGEVYFSYPP HPLTVAYAVR ELDAAGRLGA
QTTFFRLGDD PGLGRFIDQM ARRVDGRVVA PELDDLGAAV VGSYLGSRRP SAGYGGYGDW
FGGRGFWVGD