Gene Arth_4279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4279 
Symbol 
ID4443530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp13213 
End bp14520 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content63% 
IMG OID639687600 
Producthypothetical protein 
Protein accessionYP_829297 
Protein GI116662243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACT GCTCCGGGTT GAATGCAATA AATCCCTTTT GCCAGGCAGG GGCGGCCATC 
GAAGATGTCG CCAATGATGC CGTCAAGAAC ATGGCGAAAG CGATTGCCGA TGCCGTTGGA
CAAACAGTCC AAACACTCGG GACATTTTGG GTGAACGTGG GAACTCCCGC TTTAACCGCA
GCCCCCGGAG GGTCCACCGC GAGTGACCCG GTGTTGTTCC TGCAGAACAG TCTCTACTTC
TGGACGGCTT CCCTGGCTGT GATGTCTGTC CTCGTGGGTG CAGCCAAGAT GACGATTGAG
CGCAGGGGTG CACCCTTGCG GGATCTGGTG CGTTCGCTGG CGACCCTAAT CGTCGTCTCC
GGCGCCGGTG TGGCAGCAGT GGGACTGCTG ACAGTCGCGG CAGATCAGTT CTCAGCCTGG
ATCATCACCA ACTCCACGAA CGGGACCTCG TTCAACGAAA ACATCACCGC CCTATTGGCG
CTCTCGGCCA CCAGCCCGAT CGGGTCGATC ATGATCATCC TCCTGGGGCT CATCGCCATT
CTGGCCTCCG TCATGCAGAT CGTGCTAATG ATCATCCGCG GTGGCCTGCT CGTCATCCTG
ACAGGGATCT TCCCTACGGC TGCCGCTTTC AGCAACACCG AGGCCGGCAA GGGTTGGTTC
CAGAAGTGCA CAGCATGGCT GATCGCTTTC ATCCTCTACA AACCCGCGGC CGCCATTATC
TACGCGACGG CCTTCCAGCT CAGCGGCACC AAGATCTTCG GAAACGTTGG TGACGGCAAG
GACTTCGGCT CCGCGCTCCT GGCAACGGTC ACCGGACTGG CTCTGATGAT CATTGCCTTG
TTCGCCATGC CAGCCCTCAT GCGGTTCGTC ACCCCGATGG TTGGCGCTGT CGCTGGCGGC
GGTGGCGCTC TGGCAGCAGG GACGGTCGGC GCACTGGCTT CCGGCGCCAT CAGCATGGGC
ACCGCCGGAC GCGGCGGCGG ATCCTCCACC AGCTCGACGA CAACCAGCAG CACCTCGAGC
CAAAGCCCCG GATCGCAGGG CCCCTCGGGC ACTGGCAGCC AGGGAACGGC AGGAACGTCC
GGCACAGCGG GGAAGACAGG TACCCGGAGC GCCGGAGCCA CTGGCGCGGC AGCACCTACC
GGTACCGGTA GTGCTGCGGC AGGCAGCGGA GCAGCCGCCA GCGGTGGCGC TGTGGCGGCG
GGCGCGGGCG GTGTCGCCGT GCTGGCAGCA CAGAAGGGCA TTGAAGCAGG CCAAGCCGCT
TCCGGGGCCA TCAAAGACAT GAGCGAAGAA TCCACGGGGG GTGCCTGA
 
Protein sequence
MADCSGLNAI NPFCQAGAAI EDVANDAVKN MAKAIADAVG QTVQTLGTFW VNVGTPALTA 
APGGSTASDP VLFLQNSLYF WTASLAVMSV LVGAAKMTIE RRGAPLRDLV RSLATLIVVS
GAGVAAVGLL TVAADQFSAW IITNSTNGTS FNENITALLA LSATSPIGSI MIILLGLIAI
LASVMQIVLM IIRGGLLVIL TGIFPTAAAF SNTEAGKGWF QKCTAWLIAF ILYKPAAAII
YATAFQLSGT KIFGNVGDGK DFGSALLATV TGLALMIIAL FAMPALMRFV TPMVGAVAGG
GGALAAGTVG ALASGAISMG TAGRGGGSST SSTTTSSTSS QSPGSQGPSG TGSQGTAGTS
GTAGKTGTRS AGATGAAAPT GTGSAAAGSG AAASGGAVAA GAGGVAVLAA QKGIEAGQAA
SGAIKDMSEE STGGA