Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2933 |
Symbol | |
ID | 4444455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3301908 |
End bp | 3303395 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639690756 |
Product | type II secretion system protein E |
Protein accession | YP_832412 |
Protein GI | 116671479 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.242377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCGCC CGGAACTGCC CACCCGGCAG CCGGAACCGT GGACACCGCC AGCACCGTCA AAGCGGACGA CGACGGCGGC ACCGGCTGCC GCCGTCGTCG TTCCTCCAGC GGTCACAGAC CACGCGGATG AGGTGAAACC GAAAGTCCAG CAGCCCGTGG ACGCGTTCGC CGCGTTGAAG GGAAGGGCGG CCACGGCGCT GTTCGAACGC ATGGGTTCGC GTTTCAACGA CTCTGCCATC ACCGAACTGG AGCTCCGGAC GTCCGCCCGG GAAGAGCTCA CCCGCATCAT CGATGCCGAG CAGGTGCCGC TCTCCTCGGA GGAACGCACG CGCCTGGTGC AGGACGTTGC CGACGACGTG CTGGGCTACG GCCCCCTGCA GCGGCTGCTG GACGACCCCG CCGTCACCGA AATCATGGTC AACCGGATGG ACCAGATCTA CGTGGAGCGC AAGGGCCACC TCACCCTGGC CGACTCCCGC TTCAGCTCGG AAGAACACCT GCGGAAGGTC ATCGAGCGCA TCGTGTCCAA GGTGGGCCGC CGTATTGACG AGTCCTCTCC CCTGGTGGAT GCCCGGCTGG AGGACGGCTC CCGCGTCAAC GCAGTGATCC CGCCACTGGC CGTGGGCGGA TCGTCACTGA CCATCCGGAA GTTCAGCAAG GTTCCCCTCA CCGTCCGGAA CCTGATCGAC TTCGGAACGC TGACTCCGGA AATGGCGGAG CTCCTGAACG CGTGCGTCAA AGCCAAGCTC AACATCATCG TCTCGGGCGG AACGGGAACC GGTAAAACCA CCCTGCTCAA CGTGCTTTCG TCCTTCCTGC CCTCGGACGA ACGGATCGTC ACCATCGAGG ACGCCGTGGA ACTGCAGATC CAGCAGGCGC ACGTGGTCCG GCTGGAGAGC CGTCCGCCCA ACACCGAGGG CAAGGGCGAG GTGACCATCC GTGAACTGCT GCGCAACTCG CTGCGTATGC GCCCCGACCG CATCGTGGTG GGCGAGGTCC GCGGCGGCGA GTCACTGGAC ATGCTCCAGG CGATGAACAC CGGCCACGAC GGTTCTCTCT CCACGGTGCA CTCCAATTCC CCGCGTGACG CCGTGGCGCG CCTTGAAACC CTGGTGCTCA TGGCCGGGAT GGACCTGCCG CTGCGGGCCA TCCGGGAACA GATAGCCTCG GCGGTGAACC TGATCGTCCA GATTTCACGG CTGCGGGACG GCAGCCGCCG CATCACCCAC GTGACCGAGG TGCAGGGCAT GGAAGGCGAC ATCGTCACCC TGCAGGACGC CTTCGTTTTC GACTACTCGG CAGGCGTCGA CGCCCACGGC CGCTTCCTCG GACGGCCGGT TGCCACCGGG ATCCGCCCGC GTTTCATCGA CCGCTTCGAG GACCTCGGAA TCCACGTTTC CCCTGCCGTT TTCGCCACGC CCGGCAGCCC GGCCGGTCCT GCGGGCCACT CCGGCCAGGC GGGCCACTCC GGCCAGGGAA TCGCGTAG
|
Protein sequence | MDRPELPTRQ PEPWTPPAPS KRTTTAAPAA AVVVPPAVTD HADEVKPKVQ QPVDAFAALK GRAATALFER MGSRFNDSAI TELELRTSAR EELTRIIDAE QVPLSSEERT RLVQDVADDV LGYGPLQRLL DDPAVTEIMV NRMDQIYVER KGHLTLADSR FSSEEHLRKV IERIVSKVGR RIDESSPLVD ARLEDGSRVN AVIPPLAVGG SSLTIRKFSK VPLTVRNLID FGTLTPEMAE LLNACVKAKL NIIVSGGTGT GKTTLLNVLS SFLPSDERIV TIEDAVELQI QQAHVVRLES RPPNTEGKGE VTIRELLRNS LRMRPDRIVV GEVRGGESLD MLQAMNTGHD GSLSTVHSNS PRDAVARLET LVLMAGMDLP LRAIREQIAS AVNLIVQISR LRDGSRRITH VTEVQGMEGD IVTLQDAFVF DYSAGVDAHG RFLGRPVATG IRPRFIDRFE DLGIHVSPAV FATPGSPAGP AGHSGQAGHS GQGIA
|
| |