Gene Arth_4219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4219 
Symbol 
ID4443585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008539 
Strand
Start bp52400 
End bp54172 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content66% 
IMG OID639687744 
Productconjugative transfer gene complex protein 
Protein accessionYP_829441 
Protein GI116662388 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGCAC CGAACCGTAA GGGAATGGGC CTGGGGGATG CCCTGCTGGT CTGGTTCGCC 
ATCGGATTCA TCGTCATCGT CGGCGGCGGG ACTTACGCGG CCGTCCACCT GGGATCTTGG
ATGGCCGGCA TAGACGCACC CCCCAAACAT CCCATCGACC TGATCGCCGG GCTGGTCAAG
GGCCGCGTGC CGTGGCCCGT CCAGTCCACC GTCGCGGCCG CCCTCATGGC CGGCGTGGTC
CTGGCCCTGA CCATCGTCGT TCTCGTGGCC TGGAGGAAGG GTGCCTCCAA GCGTGCCCGC
GTCGACAAGG CCGCACGGTA CCTGGGTCGA GGGAAGTCCT TGGCTGCGTT CTCCGAGAAG
GGCGCGAAAG CGACGGCGGA TCGGCTGGGA GTGACAGGCA CGCCGGGCAT CGTGGTGGGC
AAGGTTGTCT CCACCGGCCA GACGTTCATT CAGTCCTGGG AAGACCTCAG CCTCGATATC
TGGGGGCCGC GTACCGGTAA GTCGACCTCA CGGGTTATGC CCGCGATCCT GGACGCACCG
GGTGCTGTGG TGTCGACCTC GAACAAGCGT GACGTGGTGG ACGGCACCCG TGGCGTCCGC
GAGCTCACGG CCCCGGTATG GGTGTTCGAT CCGCAGAAGA TCGCGCAGGA GGAAGCACAG
TGGTGGTGGA ACCCGCTCTC TTACGTCACC GACGAAGAGA AGGCCTACAA GCTCACGCAG
CACTTCTCGG TTGGGTCGCG GGTCCCGGGT TCCAAGCCGG ATGCCTACTT CGACCCCAAA
GCCGAAGACA TCCTCTCCTC GTACTTCCTC GCGGCCGCCC TCGGGGGGCT GCCCATTACG
CAGGTGTACC TGTGGGTGAC GGAGCAGGTA AACCGGGAAC CAATCAACAT CCTGAAAGAG
CATGACTACG AGCTGCAGTA CAAGGGCCTG GAGTCCACGC TGGAGCTGGC CGACAAACAG
CGCGACGGCA TCTTCGGCAC GGCCGAGAAG ATGATCCAGT GCCTCAAGAG CAGGAACACG
CTGCGCTGGG TCGCTCCCAT GGGCGGTGCG ACAGTGGCCA CGGATACCCG CCGGCAGTTC
AACCCGCACG CTTTCGCCGC CTCCCAGGAG ACGATCTACA TCCTCTCCAA GGAAGGGGCC
GGCTCCGCCT CCCCGCTCAC CACAGCGTTG ACGGTGGCGA TCGCCGAGGC GATGGAGGAA
CGGGCCGAAC GCAGCGGCGG CCGCCTGCCC AGACCGGCAC TGTTCGCCCT CGATGAGCTG
GCCAACGTCG TCCGCTGGGC AGCCCTGCCG GACCAGTTCA GCCACTACGG CTCCAAGGGC
CTGATCGTCA TGGGCATCCT GCAGTCCTGG TCACAAGGCG TTGAACTGTG GGGCGAGGCG
AACATGCGGA AAATCTGGTC AGCCGCGAAC GTCAAGGTCT ACGGCGGCGG CGTCGCCGAA
GAAGGCTTCC TACGCGCCCT CTCGGACCTG ATCGGGGACT ACAGCTACAC CAACGTCTCC
GTCTCCTCCG GCAAGTCCGG CTCCAGCCGC TCCCGCCAGG AGGGCAAGGA ACGCATCTTC
GATGTCTCCA ACCTCGCGGA GCTGGACCGT GGCCGCGCCG TGGTCCTCGC CTCCGGCGCA
CCCGCCACGC TGGTCCGGAC CATGCCCTGG TACACCGGCC GGCACAAGGA AGCCGTGGAG
GCGTCCATCA GGAAGTACAG CCCCCGCCCG GAGGAACCGG AGGCCGTCCC GGTGCCCGCC
GCTCCGGTCG CTAATCCCTG GGTCACCGGG TAA
 
Protein sequence
MSAPNRKGMG LGDALLVWFA IGFIVIVGGG TYAAVHLGSW MAGIDAPPKH PIDLIAGLVK 
GRVPWPVQST VAAALMAGVV LALTIVVLVA WRKGASKRAR VDKAARYLGR GKSLAAFSEK
GAKATADRLG VTGTPGIVVG KVVSTGQTFI QSWEDLSLDI WGPRTGKSTS RVMPAILDAP
GAVVSTSNKR DVVDGTRGVR ELTAPVWVFD PQKIAQEEAQ WWWNPLSYVT DEEKAYKLTQ
HFSVGSRVPG SKPDAYFDPK AEDILSSYFL AAALGGLPIT QVYLWVTEQV NREPINILKE
HDYELQYKGL ESTLELADKQ RDGIFGTAEK MIQCLKSRNT LRWVAPMGGA TVATDTRRQF
NPHAFAASQE TIYILSKEGA GSASPLTTAL TVAIAEAMEE RAERSGGRLP RPALFALDEL
ANVVRWAALP DQFSHYGSKG LIVMGILQSW SQGVELWGEA NMRKIWSAAN VKVYGGGVAE
EGFLRALSDL IGDYSYTNVS VSSGKSGSSR SRQEGKERIF DVSNLAELDR GRAVVLASGA
PATLVRTMPW YTGRHKEAVE ASIRKYSPRP EEPEAVPVPA APVANPWVTG