Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_3814 |
Symbol | |
ID | 8599260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | - |
Start bp | 4049707 |
End bp | 4050777 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | protein of unknown function DUF541 |
Protein accession | YP_003310579 |
Protein GI | 269122402 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGT TTATTTTAGT TTTAGCCTTG GGCTTCTCGG TATTCTTAGC CGGTGAGCCT ATAAGAAAGC TGAGTGTTAC AGGAAATGCA GAAAAGGAGG TCATGCCTGA TATAGCAAAA ATAAGCTTCA GAGTATACAC AAAAAATGAG AATCTTAAAA AAGCCGGTCA GGAAAACTCT AAAAATATGG AAAATTTTAA AAATGAGCTA AAAAAGAGAA ATATTCCTGT AACTGCTATA GAAACATATA ATTATTACAC GCAGAAAAGT ACTGAAAGAG ACACAGCGGA AAGTAAGAAG ACAGAATATT ATACAACATT ATATTTTGCA GTAAAGGTAA CAGATCTGAC TAAAATACCG GATCTGATAA GCCTATCGGA AAGCAATAAG ATAAAGAGCT TAAAAAGCGA CAGTCTGGAT AAATCAATAT ATTATGGTGA AATAAACAGA AACAGCTCTC AGAAATCATC AGCAATATCT GATACATTTA AGGTATACGA CAGCATAAAA TCCCAGCTGG CAAGGCTCGG AATAAGCGGC AGCAACATCT CGGTATATTC ATATTCAACA ACTTCACGGG AAGTAACGGA TAACAAATCT GTAAAAAACA AGGAATATCA TAATATATAT AATGACTTTG TGCTGGAGTT GAAAGATATA AGCAGGATAA ATGATGTAAT AAAAATTGCG GAAGATAACA AAATAAGCGT ACAGGGAAAT ATAGCATTCG ATATATCAAA CAAAGACCAG ATAGAGTCGG AGCTGTATAA TGCAGCCTAT GAACAGACAA AAACAAAGGC GGTAAGCATA CTGAAATCAA GTGAAATGAA ACTCGGTGAT CCGCTGGTGG TAAGTGAGAG CATATCTTAT CAGAATCAGG CCATCCAAGA GGATTATAAT TATACTAAGC AGATTAATGC AAAAAATTTA GATGTAAGAG GCGGTATGGA TATGGAGTAT AGGCAAGTAC CGGCAATGAC AGCAACAACA GAAGCCAGAC CCCAAATAGA TTATAAGCCT CAGGTAATGA AATTAACTCA GAATGTAAGC GTACTTTATG AAATAAAATA A
|
Protein sequence | MKKFILVLAL GFSVFLAGEP IRKLSVTGNA EKEVMPDIAK ISFRVYTKNE NLKKAGQENS KNMENFKNEL KKRNIPVTAI ETYNYYTQKS TERDTAESKK TEYYTTLYFA VKVTDLTKIP DLISLSESNK IKSLKSDSLD KSIYYGEINR NSSQKSSAIS DTFKVYDSIK SQLARLGISG SNISVYSYST TSREVTDNKS VKNKEYHNIY NDFVLELKDI SRINDVIKIA EDNKISVQGN IAFDISNKDQ IESELYNAAY EQTKTKAVSI LKSSEMKLGD PLVVSESISY QNQAIQEDYN YTKQINAKNL DVRGGMDMEY RQVPAMTATT EARPQIDYKP QVMKLTQNVS VLYEIK
|
| |