Gene Snas_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2501 
Symbol 
ID8883696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2643144 
End bp2644364 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID 
Productband 7 protein 
Protein accessionYP_003511276 
Protein GI291299998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.50523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGCG GGGAGGTTTT CCTGGTCCTG CTTCTGGGAC TGGTGGCGCT GTTCTTCATC 
ATCATGCTGT TCAAGATGGT GCGGATCGTG CCGCAGCAGC AGGAGTACAT CGTCGAACGA
CTGGGCAAGT ACTCCAAGAC CCTGACTCCG GGCCTGAACT TCCTCGTGCC GATCCTGGAC
GCCGTACGGT CCAAAGTCGA CAAACGCGAG CAGGTCGTCA GCTTCCCGCC GCAGCCGGTC
ATCACCTCGG ACAACCTGGT GGTGTCCATC GACACCGTCA TCTACTACAT GGTGACCGAC
TCGGTGCGCG CCACCTACGC GATCTCCAAC TACCTGCAGG GCGTCGAGCA GCTGACCGTC
ACCACGCTGC GCAACGTCGT CGGCTCCATG GACCTGGAGC AGGCACTGAC CAGCCGCGAC
ACCATCAACA GCGCGCTGCG CACCGTCCTG GACGAGGCCA CCGGCCAGTG GGGCATCAAG
GTCACCCGCG TCGAGATCAA GGCCATCGAC CCGCCACCCA GCGTGCGCGA GTCCATGGAG
AAGCAGATGC GCGCCGAGCG TGACAAGCGG GCCGCGATCC TCACCGCCGA GGGTGTCAAG
GCGTCCCAGG TGCTGACCGC CCAGGGTGAG CAGGAGGCCG CGGTGCTTCG CGCCCAGGGT
GACCGGCAGG CCCGCATCCT GCAGGCCGAG GGCCAGTCCA AGGCCATCGA GACCGTGTTC
ACCGCGATCC ACAAGTCCAA CCCGGACGAG AAGCTGCTGG CGTACCAGTA CCTGCAGACG
CTGCCGCAGA TCGCCGCCGG TCAGTCGAAC AAGCTGTGGA TGATCCCGGC CGAACTGACC
CGGGCGCTGG AGTCGTTCAG CGGCGCCGTC GGCGGCCCGA TCAGCTCCGG TCCCGCCTCG
GCGGTGACCA ACGCGGTGTC GCAGGCCCTG TCGTCCGAAC AGGACGGCAC CGGTGAGGCG
GCGCCGAGCC ACACCGACGC CGACCGCTCC GAGACCGCGC CGCGGACCCC GAGCGCCGAG
GACGACGACG CGGCCACCAC CCGCATCGGC AAACCGAAGC CGGGCACCGA CGACACCCAG
GCCCTGGAAG CCGGGCCGAA GGTCTCGCCG AGCCAGTTCA CCGCGACCGA CGCGGCGAAC
ACCATGCTGG ACTCGACCCA GGTGGTGCCG CCGCCGGTGC CGCCGACCAA GCCCGGCGAC
CTGCCGCCCA CGGCGGTCTA G
 
Protein sequence
MSGGEVFLVL LLGLVALFFI IMLFKMVRIV PQQQEYIVER LGKYSKTLTP GLNFLVPILD 
AVRSKVDKRE QVVSFPPQPV ITSDNLVVSI DTVIYYMVTD SVRATYAISN YLQGVEQLTV
TTLRNVVGSM DLEQALTSRD TINSALRTVL DEATGQWGIK VTRVEIKAID PPPSVRESME
KQMRAERDKR AAILTAEGVK ASQVLTAQGE QEAAVLRAQG DRQARILQAE GQSKAIETVF
TAIHKSNPDE KLLAYQYLQT LPQIAAGQSN KLWMIPAELT RALESFSGAV GGPISSGPAS
AVTNAVSQAL SSEQDGTGEA APSHTDADRS ETAPRTPSAE DDDAATTRIG KPKPGTDDTQ
ALEAGPKVSP SQFTATDAAN TMLDSTQVVP PPVPPTKPGD LPPTAV