Gene Snas_6447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_6447 
Symbol 
ID8887673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6800313 
End bp6801518 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID 
ProductArginine deiminase 
Protein accessionYP_003515156 
Protein GI291303878 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.762737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACA CGGAGATCAC TCACCAGGTG AGCAGCGAGG TGGGGCGGCT GCGAACAGTG 
CTTTTGCATC GCCCCGGGCC CGAGCTCAAG CGACTGACTC CACGAAACAA CGACTCATTG
CTGTTTGACG GTATCCCGTG GGTCGACAGG GCGCAAGAAG AGCACGACGC CTTCGCTGAG
GCCTTGCGCA CCCACGGCGT GGAGGTCCTG TACGTCGGTG AGCTGCTGCG CGAGACGCTC
GCGGTGCCGC ACGCGCGCAA GTCGGCGGTG GACTCGGTGC TGGACGACCG GCGGCTGGGC
GACTCGCTGC GGGCCGCCGC GCACACCCAT CTCGACGACC TCGACGCCGA GCAACTGGCG
ACCACGCTGA TCGCCGGGCT GTCGCACGAG GAGCTGCGGG GCGGCGACGG GCTGGTCTAC
ACCATGCTCG ACCAGCACGA CTTCGTCATC GACCCGCTGC CGAACCTGCT GTTCACCCGG
GACTCCTCGG TCTGGATCGG CGACCGGGTC GCGGTGACCT CGCTGGCCAT GGACGCGCGT
CGCCGCGAGA CCACGCTGAC CGACCTGATC TACACCCACC ACCCCCGCTT CGCGGGCACC
CGCAAGGTCT ACGAGCCCTC GCTGGAACAC GTCGAGGGCG GCGACGTGCT GCTGCTGGCC
CCGGGCGTGG TGGCCGTGGG CGTCGGCGAA CGCACCACCC CGTCGGGCGC CGAGCGGCTG
GCCCGCCGCA TCTTCGACGC CGGCCTTGCC CACACGCTGC TGGCGGTCCC GATCGCCCAG
GAACGCGCCA CCATGCACCT GGACACGGTG TGCACGATGG TCGACGTCGA CGCGATGGTC
ATGTACCCGA AGGTCGCCGA CACCCTGGTG GCCTACACCG TCACCAGCCA GGACGGCCAG
CCGGTGGTCG CGGGCCCGTC CCCGTTCCTG TCGGCGGCGG CCAAGGCGAT GGGCATCGAC
GCCGTCCGGC TCATCGACAC CGGCCTGGAC CCGGTCACGG CCGAACGCGA ACAGTGGGAC
GACGGCAACA ACACCCTCGC CATCGCGCCC CGGCTGTGCG TCGCGTACGA GCGCAACACC
GAGACCAACC GCCGCCTGTC GCAGGCCGGC ATCGAGGTCA TCGAGATCGC CGGTTCCGAA
CTCGGTTCGG GCCGCGGCGG CCCCCGCTGC ATGTCCTGCC CGATCGTGCG CGAGGACCTC
GGCTAA
 
Protein sequence
MADTEITHQV SSEVGRLRTV LLHRPGPELK RLTPRNNDSL LFDGIPWVDR AQEEHDAFAE 
ALRTHGVEVL YVGELLRETL AVPHARKSAV DSVLDDRRLG DSLRAAAHTH LDDLDAEQLA
TTLIAGLSHE ELRGGDGLVY TMLDQHDFVI DPLPNLLFTR DSSVWIGDRV AVTSLAMDAR
RRETTLTDLI YTHHPRFAGT RKVYEPSLEH VEGGDVLLLA PGVVAVGVGE RTTPSGAERL
ARRIFDAGLA HTLLAVPIAQ ERATMHLDTV CTMVDVDAMV MYPKVADTLV AYTVTSQDGQ
PVVAGPSPFL SAAAKAMGID AVRLIDTGLD PVTAEREQWD DGNNTLAIAP RLCVAYERNT
ETNRRLSQAG IEVIEIAGSE LGSGRGGPRC MSCPIVREDL G