Gene Achl_4444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_4444 
Symbol 
ID7280012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011879 
Strand
Start bp384669 
End bp386066 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content64% 
IMG OID643580398 
Producttype II secretion system protein E 
Protein accessionYP_002478212 
Protein GI219883048 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value0.00432597 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCTCG CTGACCGTCT GGACGCCGCC GCCGGCATCA CGCCAAGCCC GGCAGCCGAA 
CAACCCAAAG CCCCGGCCAC AGCCGCTCCA GTCAAGGAAA CAGCGCCGCC CCCGGACGCC
CTCTCAGGCC TGAAACAGCG TGCCGGCAAC GCCCTGTTCG AACGGATCGG CAGCCGCCTG
AACGACCCGG CAATGGGCGA AGACGAACTG CTGGCCTACG CGCGCGAGGA ACTTACCCAG
ATCGTGGACG CCGAGAGTGT CCCGCTCACC GGGGATGAAA AACAGCGGCT CGTCAGCCAG
ATCAGCGACG ACGTCATGGG CTACGGACCC CTGCAGAAAC TCCTGGACGA CGAGAGTGTC
TCGGAAATCA TGGTCAACGG ACCGGACCAG ATCTTCGTTG AACAAAACGG AAGGGTCACC
GAAAGCGACG CGCGCTTCAA AACCGAAGAC CACCTCCGCC GGGTCATCGA GAAGATTGTC
TCCCGCGTCG GGCGCCGCAT TGACGAGTCC TCGCCCATGG TGGACGCACG CCTCGCCGAC
GGTTCCCGCG TCAACGCCGT CATCCCGCCG CTGGCCGTCA ACGGATCATC ACTGACCATT
CGAAAGTTCG CCGCTGACCC GCTCAAGGCC GCAGACCTGG TCCGTTTCGG GTCCATCTCA
CCGGAGATGG CCGAGTTGCT GGATGCCTGC GTCAAAGCCC ACCTGAACAT CATCGTCTCG
GGCGGCACGG GGACCGGGAA GACCACCCTG CTGAACGTCC TCTCCTCCTT CATCCCGCCG
GGGGAGCGGA TCGTCACTAT CGAGGACGCC GTCGAGCTGC AGCTCCAGCA GAACCACGTT
GTGCGCCTGG AAAGCCGCCC CTCCAACGTG GAAGGCAAGG GGGAGATCAC CATCCGGGAC
CTGCTCAAGA ACTCGCTCCG TATGCGCCCT GACCGAATCG TCGTGGGCGA GTGCCGCGGC
GGGGAGGCCC TGGACATGCT GCAGGCAATG AACACCGGCC ACGACGGCTC CTTGTCCACC
ATTCACTCCA ATTCGCCCAG GGACGCGATC TCCAGGATGG AGACCCTGGT GCTGATGGCC
GGCATGGACC TGCCCCTGAG GGCTGTCCGC GAACAGATCG CTTCCGCCGT GGATGTCATC
GTGCAGCTGA CCCGGCTGCG TGACGGCAGC CGCCGCGTCA CACACGTAAC GGAAGTCCAG
GGCATGGAAG GCGACGTGGT CACCCTTCAG GACGCCTTCG TCTTTGACTA CAGCGCAGGC
GTCGACGAAG ACGGACGGTT CCTGGGCAAG CCGGTCTCCA CAGGCGTCCG CCCGAAATTC
ACGGACAAGT TCAACGACCT GGGCATCAAG CTTTCCCCCT CAGTGTTCGG CGTCCCCAGC
ATGGCCGGGA GCCGGTAA
 
Protein sequence
MNLADRLDAA AGITPSPAAE QPKAPATAAP VKETAPPPDA LSGLKQRAGN ALFERIGSRL 
NDPAMGEDEL LAYAREELTQ IVDAESVPLT GDEKQRLVSQ ISDDVMGYGP LQKLLDDESV
SEIMVNGPDQ IFVEQNGRVT ESDARFKTED HLRRVIEKIV SRVGRRIDES SPMVDARLAD
GSRVNAVIPP LAVNGSSLTI RKFAADPLKA ADLVRFGSIS PEMAELLDAC VKAHLNIIVS
GGTGTGKTTL LNVLSSFIPP GERIVTIEDA VELQLQQNHV VRLESRPSNV EGKGEITIRD
LLKNSLRMRP DRIVVGECRG GEALDMLQAM NTGHDGSLST IHSNSPRDAI SRMETLVLMA
GMDLPLRAVR EQIASAVDVI VQLTRLRDGS RRVTHVTEVQ GMEGDVVTLQ DAFVFDYSAG
VDEDGRFLGK PVSTGVRPKF TDKFNDLGIK LSPSVFGVPS MAGSR