Gene Arth_0592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0592 
Symbol 
ID4446943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp636881 
End bp638176 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content66% 
IMG OID639688390 
Productproteinase inhibitor I4, serpin 
Protein accessionYP_830091 
Protein GI116669158 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACGC ACCGTGTGGC CAGGATCGCG ACGGCCGGTG CCCTTGCGGC GCTCACCGCA 
TGTTCGGCGT CGTCCCCGGA GCTCCTGAAG GCCGACGGCG TGGAACGGAT TTCGGTGGAC
GGCGCGGCGT ATGCCGCTGA ACTGCGCTCC TTCCGGGCCT CTGCTCTCGG GCTCGGCGAG
GCCCTGCTGG CCGACGGCGG CGACGGCTCC AAAGGGAACG TGGTCTCCTC GCCGGGGAGT
CTGCTGATTG CCCTCGCCAT GCTCCGAGCC GGGGCATCCG GCGAGACGGC GGCCGAGATG
GACAGCGTCC TGAAGCTCCC CGCGGAACAT CGCGACGAAG CCATGAACGC GCTGCTCAGC
TCGCTCGGGA AGTTCGACGG CGACCCCGGC TCGGTGGATG AGGACAACCC GCCGCGGAAG
CCCGTGATGC ACGCCGCCAA CGGATTGTTC GTGGACAAGG ACGTGCCCAC GGGCGACGCC
TTCCTGGAAA CACTGGCCCG GCACTACGGA ACCGGCGTCT ATCCCGTGGA CTTCAGCAAC
GAAGCGGCAA CCAAGCCGGC CATCGATGCC TGGGTGAACC GGAACACGGG CGGCCGGATC
AAGGAGGCCC CCGCAAAGTA CGATCGCGAC AACACCTTCA GCCTGCTCAA CTCTCTCTAC
TTCGCGTCCG CCTGGCGCGC GCCCTTTGAT CCGAATGACA CCTCGGACCT GCCCTTCACG
ACGGCTGCCG GCGAGAAGAT CGACGCCCCG GCAATGCACA ACGAGCTGGA GATGAAATAC
GCGGAGGGGG CTGGCTGGCA GGGCGTGGAC CTGCCCTACG CCGACGGTTT CGTCATGCGG
CTGGTCCTCC CGGGCACAGG CGTCCCAGGT GCGGGCACAG GCGCCGGTTC GCCGGTCTTT
GGTGCGGAAC GGCTCACGGA CATTGCGGAT GCTTTTGACC GCGCACCGCT GGAGACCGTG
CAGATCCAGC TGCCCCGCTG GGACCATAAG TGCAGCTTCG ACCTGAGGAA AGTCTTTGAA
TCCCTGGGGC TCCGGAAGAC CCTCACCACC ACGGAGGACT TCAACAACAT CCAGCCCGGG
ATGATGATCA CCCAGGCCGC CCAGGCAGCC AACATCACGG TCGCGGAAAA GGGCACGGTT
GCCGCGGCTG TCACCCAGAT CAACGGAGCT GTCACCAGCG CGCCGCCCCA GCCCGAACGA
ACCATCACGT TCGACCGGCC GTTCCACTAC CAGATTGTGC ACGTCGAAAC CGGGCTGCCG
CTCTTCATGG GAACGGTGGC CGACCCCCGT TCCTAG
 
Protein sequence
MKTHRVARIA TAGALAALTA CSASSPELLK ADGVERISVD GAAYAAELRS FRASALGLGE 
ALLADGGDGS KGNVVSSPGS LLIALAMLRA GASGETAAEM DSVLKLPAEH RDEAMNALLS
SLGKFDGDPG SVDEDNPPRK PVMHAANGLF VDKDVPTGDA FLETLARHYG TGVYPVDFSN
EAATKPAIDA WVNRNTGGRI KEAPAKYDRD NTFSLLNSLY FASAWRAPFD PNDTSDLPFT
TAAGEKIDAP AMHNELEMKY AEGAGWQGVD LPYADGFVMR LVLPGTGVPG AGTGAGSPVF
GAERLTDIAD AFDRAPLETV QIQLPRWDHK CSFDLRKVFE SLGLRKTLTT TEDFNNIQPG
MMITQAAQAA NITVAEKGTV AAAVTQINGA VTSAPPQPER TITFDRPFHY QIVHVETGLP
LFMGTVADPR S