Gene Arth_2642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2642 
Symbol 
ID4444729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2962496 
End bp2963566 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content64% 
IMG OID639690462 
Producthypothetical protein 
Protein accessionYP_832121 
Protein GI116671188 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACT GGGTAGGAAT TATCTGGCTG GTGCTCCTGC TGATCGGAAA CGCCTTCTTC 
GTGGCTGCGG AATTCGCGGT GATGTCCGCG CGGCGGAGCC AGATTGAGCC GCTGGCCGAA
GCAGGGTCCC TGCGGGCCCA GACAACGTTG CGGGCCATGG AAAGCGTGTC CCTCATGCTC
GCGTGCGCCC AGCTGGGCAT CACGGTCTGC TCCCTGCTGA TCCTGCAGGT GGCTGAGCCG
GCCATCCACC ACCTGATGGC CGTGCCGCTG GAAGCCGTGG GCGTGCCGAC GGAACTCGCG
GACGTTGTGG CGTTCGCCGT GGCGCTCCTG GCGGTGACCT TCCTTCACGT GACCTTCGGC
GAGATGGTGC CCAAGAACAT CTCGGTCTCC GTCGCGGACA AGGCGGCACT GCTGCTGGCG
CCGCCGCTGA TGTTCATCGC ACGCCTTGTG AACCCGGTGA TCGTGGCCCT CAACTGGTCT
GCCAACCACA TCCTGCGCCT GCTCCGGATC GAGCCCAAGG ACGAGGTCAA CTCCTCGTTC
ACCCTGGAGG AGGTCCAGTC CATCGTGCAG GAATCCACCC GGCATGGACT CGTGGATGAC
GACGCCGGCC TCATCACCGG TGCACTGGAA TTCTCCGAGC ACACGGCGTC CCGCATCATG
GTTCCGCTGG ACAAGCTGGT CATGATGCAG TCGCCCACCA CGCCGGTGGA GTTTGAAAAA
GCCGTCAGCC GCACGGGGTT CTCCCGGTTC CCGATGATGG ACGAGGACGG GATGCTTTCG
GGCTACCTTC ACATCAAGGA TGTGCTGTCC ATCCCGGAAG CCGGATATGA GCACCCGATT
GCGGAAAGCC GCATCAGGTC CCTGGCGAAC CTGTCCATGG ATGACGAGAT CGAGAAGGCG
ATGTCCGTGA TGCAACGCAC CGGCTCGCAC CTGGCGCGCG TCATCGGACC GGACGGCAAC
ACCAGGGGCG TCCTTTTCCT GGAAGACGTG ATCGAACAGC TCGTGGGCGA GATCCGTGAC
GCCACCCAGG CGAAGGGAAT CCGCCGGCTC GGCCAGCGGA ACGGCGACTA G
 
Protein sequence
MSDWVGIIWL VLLLIGNAFF VAAEFAVMSA RRSQIEPLAE AGSLRAQTTL RAMESVSLML 
ACAQLGITVC SLLILQVAEP AIHHLMAVPL EAVGVPTELA DVVAFAVALL AVTFLHVTFG
EMVPKNISVS VADKAALLLA PPLMFIARLV NPVIVALNWS ANHILRLLRI EPKDEVNSSF
TLEEVQSIVQ ESTRHGLVDD DAGLITGALE FSEHTASRIM VPLDKLVMMQ SPTTPVEFEK
AVSRTGFSRF PMMDEDGMLS GYLHIKDVLS IPEAGYEHPI AESRIRSLAN LSMDDEIEKA
MSVMQRTGSH LARVIGPDGN TRGVLFLEDV IEQLVGEIRD ATQAKGIRRL GQRNGD