Gene Arth_4500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4500 
Symbol 
ID4443321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008537 
Strand
Start bp122570 
End bp124339 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content66% 
IMG OID639687553 
Productconjugative transfer gene complex protein 
Protein accessionYP_829250 
Protein GI116662195 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.610328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCAC CGAACCGCAA AGGCATGGGC CTGGGGGACG CCATCCTGAT CTGGCTGGCC 
ATCGGCTTCA TCGTCATCGT CGGCGGCGGC GCGTACGCCG CCGTGCACAT CGGCTCATGG
ATGGCCGGCC TGCCGAGCCC TCCGGGCCAC CCCATCGACC TGATCGCCGG CCTCGTGAAG
GGCAAAGTCA TCTGGCCGGC CCAGGCCACC GTCGTCGTGG CCGTCATGGC CGGCCTGGTC
CTCGCCCTGG CGATCCTCGT GTTCTGGGCT TGGCGGAAGG GTGCCTCCAA GCGTGCCCGG
GTCGACAAAG CCGCCCGGTA CCTCGGCCGC GGCAAGAACC TCACCGCGTT CTCCGGAAAG
GGCGCCCAAG CCACCGCAGA CCGACTGGGC GTCACCGGCA ACCCGGGCAT CGTCGTGGGC
AAGGTCGTCT CCACCGGGCA AACGTTCATC CAGTCCTGGG AAGACCTCAG CCTCGATATA
TGGGGACCCC GGACCGGTAA GTCCACCTCA CGGGTCATGC CGGCGATCCT GGACGCGCCC
GGCGCTGTCG TCTCTACCTC GAACAAGCGG GACGTCGTTG ACGGCACCCG TGGCGTCCGC
GCCGCCACCG CCCCTGTCTG GGTCTTTGAC CCGCAGAAAA TCGCTCAGGA AGAGCCCGAC
TGGTGGTGGA ACCCGCTCTC CTACGTCACC GACGAAGAGA AGGCCTACAA GCTCACACAG
CACTTCTCCG TCGGCTCCCG CGTCCCGGGG TCCAAGCCGG ACGCTTACTT CGACCCCAAG
GCCGAAGACA TACTCTCCTC CTACTTCCTC GCCGCAGCCC TCGGGGACCT GCCCATCACG
CAGGTCTACT TCTGGGTCAC CGAACAGGTG AGCCAGGAAC CAATCGAGAT CCTGAAAGAG
CACGACTACG AGCTTCAATA CCGGGGCCTG GAATCCACGC TCAAGCTCGC GGACAAGCAG
CGCGACGGAA TCTTCGGCAC CGCCGAGAAG ATGATCCAGT GCCTCAAGAG CCGCAACACC
CTCCGCTGGG TCGCCCCCAC CGGAGGAGCC ACGGTCACCA CGGACTCCCG GCGGCAATTC
AACCCCCACG CCTTCGCCGC ATCCCAGGAA ACGATCTACA TCCTTTCCAA AGAAGGTGCC
GGCTCCGCCG CCCCGCTCAC CACAGCCCTG ACAGTGGCCA TCGCAGAAGC CATGGAAGAA
CGGGCAGAAC GCAGAGGCGG ACGCCTCCCC AAGCCGGCCC TGTTTGCTCT GGACGAGCTG
GCCAACGTCG TCCGCTGGGC CGGCCTGCCG GACCAGTTCA GCCATTACGG CTCCAAAGGC
CTGATCGTCA TGGGCATCCT GCAGTCCTGG TCCCAGGGCG TCGAACTCTG GGGAGAGGCG
AACATGCGCA AGATCTGGTC CGCCGCGAAC GTCAAGGTAT ACGGCGGCGG CGTCGCCGAA
GAAGGATTCC TCCGCGCCCT CTCAGACCTC ATCGGGGACT ACAGCTACAC CAACGTCTCC
ATCAGTTCCG GCAAAACCGG GTCCAGCCGC TCCCGCCAGG AAGGGAAGGA ACGCATCTTC
GACGTATCCA ACCTCGCCGA GCTGGACCGC GGCCGCGCCG TCATCCTCGC ATCCGGCGCC
CCCGCCACAC TCGTCCGCAC CATGCCCTGG TACACCGGCA CACACAAGGA CGCTGTGGAA
GCGTCGATCA AGCAGTACAG CCCGCGCCCC GAAGAGGAAG CTGTGCCGGT CGCTGCCGTG
CCGGGCTCCA ACCCCTGGGT CACAGGCTAG
 
Protein sequence
MSAPNRKGMG LGDAILIWLA IGFIVIVGGG AYAAVHIGSW MAGLPSPPGH PIDLIAGLVK 
GKVIWPAQAT VVVAVMAGLV LALAILVFWA WRKGASKRAR VDKAARYLGR GKNLTAFSGK
GAQATADRLG VTGNPGIVVG KVVSTGQTFI QSWEDLSLDI WGPRTGKSTS RVMPAILDAP
GAVVSTSNKR DVVDGTRGVR AATAPVWVFD PQKIAQEEPD WWWNPLSYVT DEEKAYKLTQ
HFSVGSRVPG SKPDAYFDPK AEDILSSYFL AAALGDLPIT QVYFWVTEQV SQEPIEILKE
HDYELQYRGL ESTLKLADKQ RDGIFGTAEK MIQCLKSRNT LRWVAPTGGA TVTTDSRRQF
NPHAFAASQE TIYILSKEGA GSAAPLTTAL TVAIAEAMEE RAERRGGRLP KPALFALDEL
ANVVRWAGLP DQFSHYGSKG LIVMGILQSW SQGVELWGEA NMRKIWSAAN VKVYGGGVAE
EGFLRALSDL IGDYSYTNVS ISSGKTGSSR SRQEGKERIF DVSNLAELDR GRAVILASGA
PATLVRTMPW YTGTHKDAVE ASIKQYSPRP EEEAVPVAAV PGSNPWVTG