Gene Snas_4221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4221 
Symbol 
ID8885422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4516642 
End bp4518222 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content67% 
IMG OID 
ProductTerminase 
Protein accessionYP_003512963 
Protein GI291301685 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.329871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.306119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCC CCCGCGGACT ACCCGAGCGC ACTCTCGGCT GGGAAGTCCT GGCGTGGACG 
GCCGCTTATC TCCACCAGCC CGACGGCCCC TATGCCGGGA ACCCGTGGCG GGCAACGCCT
GAGCAGGTCC GACACGTTTT GTGGTGGTAC GCGATCGACG AGGCTGGGCG GTTCCTGTAC
CGCCGGTCGA TCTTGCGCCG GTCGAAGGGC TGGGGCAAGG ACCCGGTCGC CGCCGTGCTG
TCGCTGGTCG AGCTGTTGGG CCCGTGCCGC TACGGAGGCA CCAACGCCCA AGGCCAGCCG
GTCGCGGTGC CGCATCCGTC CCCGTGGGTG CAGATTGCCG CGACGTCCGA GGCGCAGACG
GTCAACACGA TGTCGCTGAT TCTGTCCATG CTGGAATACG GTTCGCTGGT CGACGACTAT
TCGCTCGACG TCGGCAAGAC CCTCATCTAT ACCCCGCGCG GGCGTCTGCA TGCGGTGACC
TCATCCCCGC GAAGCCTGGA AGGCCCCCGA CCGTCCTATG TGGTTCTCGG GGAGCCTCAG
AACTGGTTGC CGTCCAACGG CGGCCAAGCC ATGTCCGAGG TGATCCGTCG CAACCTCGGC
AAGAGCCGCG ACGGCGCGGC GCGAAGCACC GAGATCGGTA ACGCGCACTT GCCCGGCGAG
GATTCCGTCG CTGAGTCGTC CTATGAGGCG TGGCTTTCGA TGGTCGAGGG CCGCTCGCGT
GACACCGGCA TCCTGTACGA CTCCCGCGAG GCACCGCCCG ACACGGACAT GAGCGACCCC
GAGTCCCTGC GGGCGGGACT TCGCGCCGCC TACGGTGACA GTCATTGGGT TGACCTCGAT
CGGGTCATGG GCGAAATCTG GGACCCGGCC ACCCCGCCCA GCGTCTCGCG GCGTTACTAC
TTGAACCACG TCACCGCCGC TGAGGACGCG TGGTGCGCCG CCCACGAGTG GGACTCGTGC
GAGACCACCG ACCGGATTCA ACCCGGCGAC ACCGTCACTA TTGGATTCGA CGGGTCGGTG
TCGGACGACT CCACCGCCAT CGTGTTGTGC CGCGTCGACG ACGGCCTCGT CGACCTGGCG
GCGGTGTGGG AGAAACCCGA CGGACCGGCG GGCGATGACT GGCGCGTGCC GCGCGACCAG
GTCGACGAGA TGGTCGACCA CCTCATCGCC ACCTATGACG TCGCCGCCGA TTACAGCGAC
GTCGCCTATT GGGAGTCCTA TATCGATACC TGGTCGATCA GGTACGCCGA CGTCGTGCGG
CACAAGGCGA GCCCTAAGTC GCTCTTCGGG TGGGACATGC GAAGCCATGC CAAGGAATTC
GTGTTGCGGG GCGCCGAGGC GACGCTGTCG GCGATCACCG ACGGGACGTT GAAACACACC
GGCAATCCGA TCCTACGTAG ACACGTCCTC AACGCACGAC GCCGACCGAA TCGGTGGGGG
CTGTCCTTCG GCAAGGAATC CCGAACCAGC TCCCGCAAGG TTGACGCCGT AGCCGCGATG
TGCCTTGCCC GCATTGCCCG CGCCGACGTA CTCGCCACCG GTGCCGGACG CCAACGCACC
GGCGAAGTCT GGGCCCTGTA G
 
Protein sequence
MTGPRGLPER TLGWEVLAWT AAYLHQPDGP YAGNPWRATP EQVRHVLWWY AIDEAGRFLY 
RRSILRRSKG WGKDPVAAVL SLVELLGPCR YGGTNAQGQP VAVPHPSPWV QIAATSEAQT
VNTMSLILSM LEYGSLVDDY SLDVGKTLIY TPRGRLHAVT SSPRSLEGPR PSYVVLGEPQ
NWLPSNGGQA MSEVIRRNLG KSRDGAARST EIGNAHLPGE DSVAESSYEA WLSMVEGRSR
DTGILYDSRE APPDTDMSDP ESLRAGLRAA YGDSHWVDLD RVMGEIWDPA TPPSVSRRYY
LNHVTAAEDA WCAAHEWDSC ETTDRIQPGD TVTIGFDGSV SDDSTAIVLC RVDDGLVDLA
AVWEKPDGPA GDDWRVPRDQ VDEMVDHLIA TYDVAADYSD VAYWESYIDT WSIRYADVVR
HKASPKSLFG WDMRSHAKEF VLRGAEATLS AITDGTLKHT GNPILRRHVL NARRRPNRWG
LSFGKESRTS SRKVDAVAAM CLARIARADV LATGAGRQRT GEVWAL