Gene Arth_4108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4108 
Symbol 
ID4447698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4628077 
End bp4629444 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content67% 
IMG OID639691939 
Productputative secreted protein 
Protein accessionYP_833583 
Protein GI116672650 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATC AGCATAGCGT CAATCGCCGC CGCCTTAGAC CCTCTTTTCC TCCACTTCGC 
CCGGCGCGGC GGCCCGGTTT GAGGGTTTCC CCTATCGGCG TTCCGGGAGC GGCGGGACTT
GCCGTTGCCG CCCTCGTCCT GAGCGCTTGC GCCGCGGACC CGGGGCGGGG CGCTGACGGC
GCCCAGTCGG GAGGCGGCGC GTCATCCGCC GGGCAGTCCA CCGGGTCGTC CAGTGCAAGC
CAGGAGGTCC CGGGCCCCGC GCTCAGGCTG GTATTGACTC ATGCCGGGGG CATCACGGTG
CTCGATGCCA CCTCACTGGA GGTCGTGGGG GAGGCCGAGC TTACGGGCTT CAACAGATTG
AACCCTGCGG GCGACGGCCG GCATGTCCTG GTGTCCACCG GAAACGCGTT CAGGGTGTTC
GATGCGGGCG TGTGGACGGA GAAGCATGGT GATCACGGGC ATTCCTACGC CGTTGAGCCG
TCCCTCACGG CTGCCTCGTT CGAAGCCAGC AAGGCCGGCC ACGCTGTCTT TCATTCCGGC
CGGACCGCCC TCTTCAGCGA TGGTTCCGGA AAGGTGGAGC TCTTCGACCC GGCAGGACTG
GGGGAGAGTG CGGGTGTCCT TCCGGACAGT GATGTCTACA CGACTGCGGA AGCCCATCAT
GGCGTAGCGG TGCCGCTCGA GGCGGGCAAA CTCCTCGTCA CGGTCGGAGA CGAGGAGTCG
AGGCGGGGGA TTGCCGTGCT GGCAGCGGGA GCGGGGCAGG ACCGCGCGGA ACTGGTGCGC
AACGAGGACT GCCCCGGCGT GCACGGCGAA GCAGCCGCAG GCCCGGATAC AGTCGTGGTG
GGCTGTGAAG ACGGCATGCT GATCTACCGG GACGGGAAGA TTTCCAAAGT GGCAAGCCCC
GACGCCTACG GGCGGATGGG AAACCAGGCG GGTTCGCCCA GGTCGCCGGT GGTTCTGGGT
GACTACAAAG TGGATAAGGA TGCTGCGCTG GAACGGCCCA CGCGCGTCTC GCTCGTCAAT
ACGGAGACGG CCACCCTCCG GCTGGTGGAA CTCGGTACGA GCTATTCGTT CCGCTCGCTG
GGCAGGGGTG CCGCCGGCGA GGCCCTGGTC CTGGGGACCG ACGGCGCCCT GCGTGTCATT
GACCCATTGA CCGGAAGCAT CACCTCCACC ATCCCCGTCG TTGACGCCTG GGAGGAATCG
GAAACGTGGC AGGACCCGCG CCCGACGCTG TTTGTGCAGG GCTCCACCGC CTACGTCACG
GAGCCTGCAG AAAGCGCGAT CCATGCCGTG GACCTTGCCT CGGGCAAAGT AACCAAATCG
GCGGAACTCG CGCACGTGCC CAACGAGCTG ACGGGAGTCT CGGGCTAG
 
Protein sequence
MKNQHSVNRR RLRPSFPPLR PARRPGLRVS PIGVPGAAGL AVAALVLSAC AADPGRGADG 
AQSGGGASSA GQSTGSSSAS QEVPGPALRL VLTHAGGITV LDATSLEVVG EAELTGFNRL
NPAGDGRHVL VSTGNAFRVF DAGVWTEKHG DHGHSYAVEP SLTAASFEAS KAGHAVFHSG
RTALFSDGSG KVELFDPAGL GESAGVLPDS DVYTTAEAHH GVAVPLEAGK LLVTVGDEES
RRGIAVLAAG AGQDRAELVR NEDCPGVHGE AAAGPDTVVV GCEDGMLIYR DGKISKVASP
DAYGRMGNQA GSPRSPVVLG DYKVDKDAAL ERPTRVSLVN TETATLRLVE LGTSYSFRSL
GRGAAGEALV LGTDGALRVI DPLTGSITST IPVVDAWEES ETWQDPRPTL FVQGSTAYVT
EPAESAIHAV DLASGKVTKS AELAHVPNEL TGVSG