Gene Arth_2933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2933 
Symbol 
ID4444455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3301908 
End bp3303395 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID639690756 
Producttype II secretion system protein E 
Protein accessionYP_832412 
Protein GI116671479 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.242377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGCC CGGAACTGCC CACCCGGCAG CCGGAACCGT GGACACCGCC AGCACCGTCA 
AAGCGGACGA CGACGGCGGC ACCGGCTGCC GCCGTCGTCG TTCCTCCAGC GGTCACAGAC
CACGCGGATG AGGTGAAACC GAAAGTCCAG CAGCCCGTGG ACGCGTTCGC CGCGTTGAAG
GGAAGGGCGG CCACGGCGCT GTTCGAACGC ATGGGTTCGC GTTTCAACGA CTCTGCCATC
ACCGAACTGG AGCTCCGGAC GTCCGCCCGG GAAGAGCTCA CCCGCATCAT CGATGCCGAG
CAGGTGCCGC TCTCCTCGGA GGAACGCACG CGCCTGGTGC AGGACGTTGC CGACGACGTG
CTGGGCTACG GCCCCCTGCA GCGGCTGCTG GACGACCCCG CCGTCACCGA AATCATGGTC
AACCGGATGG ACCAGATCTA CGTGGAGCGC AAGGGCCACC TCACCCTGGC CGACTCCCGC
TTCAGCTCGG AAGAACACCT GCGGAAGGTC ATCGAGCGCA TCGTGTCCAA GGTGGGCCGC
CGTATTGACG AGTCCTCTCC CCTGGTGGAT GCCCGGCTGG AGGACGGCTC CCGCGTCAAC
GCAGTGATCC CGCCACTGGC CGTGGGCGGA TCGTCACTGA CCATCCGGAA GTTCAGCAAG
GTTCCCCTCA CCGTCCGGAA CCTGATCGAC TTCGGAACGC TGACTCCGGA AATGGCGGAG
CTCCTGAACG CGTGCGTCAA AGCCAAGCTC AACATCATCG TCTCGGGCGG AACGGGAACC
GGTAAAACCA CCCTGCTCAA CGTGCTTTCG TCCTTCCTGC CCTCGGACGA ACGGATCGTC
ACCATCGAGG ACGCCGTGGA ACTGCAGATC CAGCAGGCGC ACGTGGTCCG GCTGGAGAGC
CGTCCGCCCA ACACCGAGGG CAAGGGCGAG GTGACCATCC GTGAACTGCT GCGCAACTCG
CTGCGTATGC GCCCCGACCG CATCGTGGTG GGCGAGGTCC GCGGCGGCGA GTCACTGGAC
ATGCTCCAGG CGATGAACAC CGGCCACGAC GGTTCTCTCT CCACGGTGCA CTCCAATTCC
CCGCGTGACG CCGTGGCGCG CCTTGAAACC CTGGTGCTCA TGGCCGGGAT GGACCTGCCG
CTGCGGGCCA TCCGGGAACA GATAGCCTCG GCGGTGAACC TGATCGTCCA GATTTCACGG
CTGCGGGACG GCAGCCGCCG CATCACCCAC GTGACCGAGG TGCAGGGCAT GGAAGGCGAC
ATCGTCACCC TGCAGGACGC CTTCGTTTTC GACTACTCGG CAGGCGTCGA CGCCCACGGC
CGCTTCCTCG GACGGCCGGT TGCCACCGGG ATCCGCCCGC GTTTCATCGA CCGCTTCGAG
GACCTCGGAA TCCACGTTTC CCCTGCCGTT TTCGCCACGC CCGGCAGCCC GGCCGGTCCT
GCGGGCCACT CCGGCCAGGC GGGCCACTCC GGCCAGGGAA TCGCGTAG
 
Protein sequence
MDRPELPTRQ PEPWTPPAPS KRTTTAAPAA AVVVPPAVTD HADEVKPKVQ QPVDAFAALK 
GRAATALFER MGSRFNDSAI TELELRTSAR EELTRIIDAE QVPLSSEERT RLVQDVADDV
LGYGPLQRLL DDPAVTEIMV NRMDQIYVER KGHLTLADSR FSSEEHLRKV IERIVSKVGR
RIDESSPLVD ARLEDGSRVN AVIPPLAVGG SSLTIRKFSK VPLTVRNLID FGTLTPEMAE
LLNACVKAKL NIIVSGGTGT GKTTLLNVLS SFLPSDERIV TIEDAVELQI QQAHVVRLES
RPPNTEGKGE VTIRELLRNS LRMRPDRIVV GEVRGGESLD MLQAMNTGHD GSLSTVHSNS
PRDAVARLET LVLMAGMDLP LRAIREQIAS AVNLIVQISR LRDGSRRITH VTEVQGMEGD
IVTLQDAFVF DYSAGVDAHG RFLGRPVATG IRPRFIDRFE DLGIHVSPAV FATPGSPAGP
AGHSGQAGHS GQGIA