Gene SeD_A1285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1285 
Symbol 
ID6875530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1269955 
End bp1271472 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content45% 
IMG OID642784456 
Productphase-1 flagellin 
Protein accessionYP_002215129 
Protein GI198242796 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.540983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.0231264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAAG TCATTAATAC AAACAGCCTG TCGCTGTTGA CCCAGAATAA CCTGAACAAA 
TCTCAGTCCT CACTGAGTTC CGCTATTGAG CGTCTGTCCT CTGGTCTGCG TATCAACAGC
GCGAAAGACG ATGCGGCAGG CCAGGCGATT GCTAACCGCT TCACTTCTAA TATCAAAGGT
CTGACTCAGG CTTCCCGTAA CGCTAACGAC GGCATTTCTA TTGCGCAGAC CACTGAAGGT
GCGCTGAATG AAATCAACAA CAACCTGCAG CGTGTGCGTG AGTTGTCTGT TCAGGCCACT
AACGGGACTA ACTCTGATTC CGATCTGAAA TCTATCCAGG ATGAAATTCA GCAACGTCTG
GAAGAAATCG ATCGCGTTTC TAATCAGACT CAATTTAACG GTGTTAAAGT CCTGTCTCAG
GACAACCAGA TGAAAATCCA GGTTGGTGCT AACGATGGTG AAACCATTAC CATCGATCTG
CAAAAAATTG ATGTGAAAAG CCTTGGCCTT GATGGGTTCA ATGTTAATGG GCCAAAAGAA
GCGACAGTGG GTGATCTGAA ATCCAGCTTC AAGAATGTTA CGGGTTACGA CACCTATGCA
GCGGGTGCCG ATAAATATCG TGTAGATATT AATTCCGGTG CTGTAGTGAC TGATGCAGTA
GCACCGGATA AAGTATATGT AAATGCAGCA AACGGTCAGT TAACAACTGA CGATGCGGAA
AATAACACTG CGGTTGATCT CTTTAAGACC ACTAAATCTA CTGCTGGTAC CGCTGAAGCC
AAAGCGATAG CTGGTGCCAT TAAAGGTGGT AAGGAAGGAG ATACCTTTGA TTATAAAGGC
GTGACTTTTA CTATTGATAC AAAAACTGGT GATGACGGTA ATGGTAAGGT TTCTACTACC
ATCAATGGTG AAAAAGTTAC GTTAACTGTC GCTGATATTG CCATTGGCGC GGCGGATGTT
AATGCTGCTA CCTTACAATC AAGCAAAAAT GTTTATACAT CTGTAGTGAA CGGTCAGTTT
ACTTTTGATG ATAAAACCAA AAACGAGAGT GCGAAACTTT CTGATTTGGA AGCAAACAAT
GCTGTTAAGG GCGAAAGTAA AATTACAGTA AATGGGGCTG AATATACTGC TAACGCCACG
GGTGATAAGA TCACCTTAGC TGGCAAAACC ATGTTTATTG ATAAAACAGC TTCTGGCGTA
AGTACATTAA TCAATGAAGA CGCTGCCGCA GCCAAGAAAA GTACCGCTAA CCCACTGGCT
TCAATTGATT CTGCATTGTC AAAAGTGGAC GCAGTTCGTT CTTCTCTGGG GGCAATTCAA
AACCGTTTTG ATTCAGCCAT TACCAACCTT GGCAATACGG TAACCAATCT GAACTCCGCG
CGTAGCCGTA TCGAAGATGC TGACTATGCA ACGGAAGTTT CTAATATGTC TAAAGCGCAG
ATTCTGCAGC AGGCTGGTAC TTCCGTTCTG GCGCAGGCTA ACCAGGTTCC GCAAAACGTC
CTCTCTTTAC TGCGTTAA
 
Protein sequence
MAQVINTNSL SLLTQNNLNK SQSSLSSAIE RLSSGLRINS AKDDAAGQAI ANRFTSNIKG 
LTQASRNAND GISIAQTTEG ALNEINNNLQ RVRELSVQAT NGTNSDSDLK SIQDEIQQRL
EEIDRVSNQT QFNGVKVLSQ DNQMKIQVGA NDGETITIDL QKIDVKSLGL DGFNVNGPKE
ATVGDLKSSF KNVTGYDTYA AGADKYRVDI NSGAVVTDAV APDKVYVNAA NGQLTTDDAE
NNTAVDLFKT TKSTAGTAEA KAIAGAIKGG KEGDTFDYKG VTFTIDTKTG DDGNGKVSTT
INGEKVTLTV ADIAIGAADV NAATLQSSKN VYTSVVNGQF TFDDKTKNES AKLSDLEANN
AVKGESKITV NGAEYTANAT GDKITLAGKT MFIDKTASGV STLINEDAAA AKKSTANPLA
SIDSALSKVD AVRSSLGAIQ NRFDSAITNL GNTVTNLNSA RSRIEDADYA TEVSNMSKAQ
ILQQAGTSVL AQANQVPQNV LSLLR