Gene Arth_3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3372 
Symbol 
ID4444101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3789243 
End bp3790544 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID639691195 
Producttype II secretion system protein E 
Protein accessionYP_832847 
Protein GI116671914 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGC TTCCACAGAT GCCGGGTGCG TCCCGGTTTC CCGGGCCGGC CGGCGGCCTT 
AGGCGGCGGC AGAACAGTGC ACTGGACGCC GGGCTGCTCG AATCCGTCCG CGAATCAGTG
ATGGCCGATT CCGGTCCGGT GACCCCCTCC CGGGTGGCCG CGGCCGTTCA GGCCACGGGA
AGGCTCCTGG GCACGGCGGG GTCGCTGGCC GCCGTCGAAC GGATCAGCGC AGAGCTCAAC
GGTCTGGGAC CGCTGCAGGT GCTGACCAGG GATCCGTCCG TAACGGACAT CTTCGTCAAC
GCCCCGGACT CCGTCTGGCT GGACCGCGGA AACGGCCTGG AGCAGGCGGC GGTGTCGTTC
TCCTCCGAAA GCGAGGTACG TTCGCTGGCG GCCCGCCTCG TGGCGGCAGG CGGGCGGCGT
CTGGACGACG GATCCCCGTG CGTCGATGTC AGGCTTGAGG CCGGATACCG GGTCCACGCA
GTCCTGCCGC CGATCTCGAC AGCCGGGACG CTGTTGAGCG TCAGAATCCG CCGTCACGAG
GTGTTCACGC TGGACGAGCT CCGGGACGGC GGCATGTTTG GTTCTTTGGT CCAGGACGTA
CTGGAACGCG TGGTTTCCCG GCGTCTGAGC TTCCTGGTCA GCGGTGCCAC CGGGTCAGGG
AAGACCACCC TCCTCTCAAC ACTCCTGGGG CTGAGCGAGC CTGGCGAACG GCTCGTCCTG
ATCGAGGATG CTTCCGAACT GAACCCCGTC CATCCGCACG TGGTGTCACT TGAGTCGAGG
CACGGAAACC TTGAAGGCGG CGGTGCGGTG GACCTCGCCG AACTGGTACG GCAGGCCCTC
CGAATGAGAC CTGACCGCCT GGTGGTGGGG GAATGCCGCG GAGCCGAGGT CCGCGAACTG
CTGACGGCTA TGAATACCGG ACACACCGGG GGCGGCGGAA CGATCCACGC GAACACGGCT
GCCGCCGTGC CTGCCCGCCT CACGGCGCTC GGCGCCCTTG CCGGAATGGG TCAGGACGCC
ATGCGGCTGC AGGTTGCCAG CGCTTTGGAC GTTGTGGTCC ACGTGGAGCG TTCCCGCGGC
ATCCGTCAGG TGGCCTGCAT CGGGTTGGTT GAAGACGGCC CGCTCGGACT GGAAGTCTCG
GCGGCCGTGG CTGTGCAGGC GGGTACCGTC ACCCTGGGAC CCTCCTGGCC GAGGCTTGCG
CGAAGACTGG GCATTGATGC TTCCGGCGCC GCAAACCCCG GCGCCGCGGA CCCCGGCCAG
GCAGCCACCG GTGCCGGACC CCTGAGTCCA GTGCACAGGT GA
 
Protein sequence
MSTLPQMPGA SRFPGPAGGL RRRQNSALDA GLLESVRESV MADSGPVTPS RVAAAVQATG 
RLLGTAGSLA AVERISAELN GLGPLQVLTR DPSVTDIFVN APDSVWLDRG NGLEQAAVSF
SSESEVRSLA ARLVAAGGRR LDDGSPCVDV RLEAGYRVHA VLPPISTAGT LLSVRIRRHE
VFTLDELRDG GMFGSLVQDV LERVVSRRLS FLVSGATGSG KTTLLSTLLG LSEPGERLVL
IEDASELNPV HPHVVSLESR HGNLEGGGAV DLAELVRQAL RMRPDRLVVG ECRGAEVREL
LTAMNTGHTG GGGTIHANTA AAVPARLTAL GALAGMGQDA MRLQVASALD VVVHVERSRG
IRQVACIGLV EDGPLGLEVS AAVAVQAGTV TLGPSWPRLA RRLGIDASGA ANPGAADPGQ
AATGAGPLSP VHR