Gene Arth_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4149 
Symbol 
ID4447617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4671586 
End bp4672671 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content68% 
IMG OID639691980 
Producthypothetical protein 
Protein accessionYP_833624 
Protein GI116672691 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTCTCG CTATTCGCGA TCTCCGCTTT GCCAAAGGAC GCTTTGCCCT GATGGCAGCC 
GTTATCGCCC TGATCACCTT GTTGCTCGTG ATGCTCTCTG GCCTCACCGC CGGTTTGGGA
AACCAGTCGA CGTCGGCAAT CACCGCGCTG CGGGCCGACC AGATCGTGTT CGGAGCCCCC
GCCGGCACTC CGGCCAAGGC ATCCTTCACC GAATCGGAAG TGAGCCGCGA CCAGCTGGCC
GCCTGGTCGG GACGGGACGG GGTTTCGGGG GTGGAGGCGC TCGGCATCAG CCAGGCCCGC
GCTCAGGCCG TTGGCCCCGC CGGAGCCCCG GGCGGCACCG CAAACGTGGC CGTCTTCGGA
TCCGGAAACG GAAACTCGGG GGACCCGGAG GACGGAACAG TAGTGGTGGG CGAAACCCTC
GCTGCGGACC TGCACCTGAG CCCCGGCAGC CGCCTTGCGG TGGGCGGGGC GGAACTTGCC
GTTGCGGACA TCGTCCCGGA CGAGTGGTAC TCGCATACCG GCGTCATCTG GACGTCGCTG
AACGACTGGC GGCAATTGGC CCGCGCAGGC AACGGATCAC TCGGCACCGT GCTGGCCGTA
ACGTTCGACG CCGGCGCCCG GGTTGACGTG GACGCCGCCA ACGCGGCGGC GGGAACAGTC
AGCGCCACCC GTGAAGGCTC GTTCCAGGCG CTGGGGTCGT TCAAAAGCGA AAACGGCTCG
CTGGTGCTGA TGCAGGCGTT CCTGTACGGC ATCTCGGCCC TGGTGATCGT GGCGTTCCTG
ACGGTATGGA CTGTTCAGCG GACCCGCGAC ATTGCCGTCG TCAAGGCAAT GGGCGGGTCC
CCGGGGTATG TGCTCCGCGA TGCGATGGCG CAGGCCGGGA TGGTGCTGGC AGCAGGGACG
GTTACCGGCG GCGGAGCAGG ACTGCTCGGC GGGATTTTTG CGGCACAGGC TGCCCCGTTC
CTGGTCACAC CGGACACCAC GCTCGTTCCC ATTGCCGGAA TCCTGCTCCT GGGCCTCAGC
GGAGCCGTCG TGGCGGTCCG CGGCGTTACC CGGGTTGACC CGCTACTTGC CCTCGGCGGC
AACTGA
 
Protein sequence
MFLAIRDLRF AKGRFALMAA VIALITLLLV MLSGLTAGLG NQSTSAITAL RADQIVFGAP 
AGTPAKASFT ESEVSRDQLA AWSGRDGVSG VEALGISQAR AQAVGPAGAP GGTANVAVFG
SGNGNSGDPE DGTVVVGETL AADLHLSPGS RLAVGGAELA VADIVPDEWY SHTGVIWTSL
NDWRQLARAG NGSLGTVLAV TFDAGARVDV DAANAAAGTV SATREGSFQA LGSFKSENGS
LVLMQAFLYG ISALVIVAFL TVWTVQRTRD IAVVKAMGGS PGYVLRDAMA QAGMVLAAGT
VTGGGAGLLG GIFAAQAAPF LVTPDTTLVP IAGILLLGLS GAVVAVRGVT RVDPLLALGG
N