Gene Arth_2373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2373 
Symbol 
ID4444987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2660902 
End bp2661978 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content69% 
IMG OID639690181 
Producthypothetical protein 
Protein accessionYP_831852 
Protein GI116670919 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCGT GGCGTCCGTC CAGACTCCCT GAGCGCGCCC GCATGGCCAG CCTCCTGACC 
CGGGTGAAAT CTAAAATGTC GATCTTCGCC CACCGCAAGG CACGCGGGAT GCTCGACGGC
GAATACGGTT CCGTTTTCAA GGGGCGCAGC CTGGACTTTG ACGACCTCCG TGCCTACATT
CCCGGAGACG AAGTCCGCGA CATCGACTGG AAAGCCTCTG CCCGGCACGG ATCCCCGCTC
ATCAAACGCT ATGTTGCAGT GCGGCGGCAG ACAGTGCTGC TGGTCACGGA TACCGGACGC
AACATGGCTG CTTCCTCGCT CGGCGGCGAG GAGAAGAAGG ACATTGCCGT GATGGCCCTG
GGCGTGGTGG GCTACCTTGC CCACCGTCAC GGCGACGTAG TGGGGCTCGT GTGCGGCGAC
GGGACGTCGA CCCGGTCGCT GCCCGCGAAA GCCGGCGAGG CCCACCTGGA AAGGCTTCTT
CGCGAAGTCG ACGGGGCCAC GGCGCTGGCC TCGCCCCGAA GCAACATCAG CGAGCAGCTC
TCCTATGTGG CACGCAACTT CGGCCAGCGC ATGCTGCTCT TCGTTGTGGC CGACGAGCTG
GTGCCGGATG CCGGGATGGA GCGGCTGCTG CGGCGGCTGC GCGCGCAGCA CGAAGTCCTC
TGGCTGACCG TCCGCGACGC GCAGTTGGCC GGACCCGCCG CCGGACCGAA CCCCGCCGGA
CCCGCCGCCG GACCGAACCC CGCCGGACCC GCCGCCGGAC CGAACCCCGC CGGACCCGCC
GCCGGACCGA ACCCCGCCGG ACCCGCCGCC GGACCGAACC CCGCCGGACC CGCCGCCGGA
CCGAACCCCG CCGAACCCGT CGACCGCTAC GACGTTGCGG ATGCCGGCTT CCTTCCCGGA
CGCCTTGCGG CGTCTGATGC CATCATCCGG GCCTATGCCG CGGCGCAGGA GCAGCGCGAT
GCCGCCCGGG AGGCTGTGCT GCGGCGGATG GGCATTGCCC ACGTCGATGC GGGCAGCAGC
CATGATGTGA TGCCTGCGGT GTTCACCCTG CTGGAACGGC ACCGCCGTGG GAAATGA
 
Protein sequence
MPSWRPSRLP ERARMASLLT RVKSKMSIFA HRKARGMLDG EYGSVFKGRS LDFDDLRAYI 
PGDEVRDIDW KASARHGSPL IKRYVAVRRQ TVLLVTDTGR NMAASSLGGE EKKDIAVMAL
GVVGYLAHRH GDVVGLVCGD GTSTRSLPAK AGEAHLERLL REVDGATALA SPRSNISEQL
SYVARNFGQR MLLFVVADEL VPDAGMERLL RRLRAQHEVL WLTVRDAQLA GPAAGPNPAG
PAAGPNPAGP AAGPNPAGPA AGPNPAGPAA GPNPAGPAAG PNPAEPVDRY DVADAGFLPG
RLAASDAIIR AYAAAQEQRD AAREAVLRRM GIAHVDAGSS HDVMPAVFTL LERHRRGK