Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3841 |
Symbol | |
ID | 5541345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5020708 |
End bp | 5021658 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640895951 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001433896 |
Protein GI | 156743767 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00994535 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.14386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCT ATCTGCTGCA ACGCCTGGCG CTGGTCATTC CAACGCTGGC AGGAGTGTCA CTCATTATTT TCGCCCTGAT GCGCCTCTTG CCCGGGGATG TGGTCGATAT TCTCTTTGGC GGCGATACGC AGGCCGATCA ACGCACGCTC GATCAGATTC GTGAGAATCT GGGTCTCAAT CGCCCGCTGG CGGTGCAGTA TCTGGAGTGG ATCGGCGGGT TTCTGAGCGG CAATTTTGGC GTTTCGATGC GCACGGGCAT CCCTGTGGCA GAAAGTATCG CCCAGGGGAT GCCGGTCACA CTGCAATTGG CGGTGATGGC GATCTTCTTT GCGTGCCTGT TTGCCATTCC GCTGGGGATC ATCGCTGCGG TGCGCCGTAA CGGGGTCACC GATATGCTGA CGCGCATCGT CGGGTTGATC GGTCTTTCTT TCCCCGCTTT CTGGCTGGCA ACGATGTTTC TGCTGATCAG TTCGACGATG TTCCGCTGGA CGCCGCCGCT GGGCTGGGTA TCGCCCTTCG CCGATTTTGG GCGCAACATG CAGATGATGC TGGCGCCGGC GCTCCTGCTG GCGCTCCAAC CAATGGCGAT CATTATGCGT ATGACCCGCG CATCGTTGCT GGAAGTGCTG CGCCAGGATT ATATTCGGAC AGCCTACGCC AAGGGATTGC GTGATCGCGC AGTTCTGCTG CGCCATGCGC TCCAAAATGC GTTCATTCCG GTCCTGACCG TCATCGGCGT CCAGTTTGGG GTGTTGATGG GAGGTTCGAT CATTATCGAG CAGATTTTTT CGCTGCCCGG CATTGCATTT CTGTTGATCA ACGGGATTTA CAACCGTGAT TATCCGGTTG TCCAGAGTAC GGTGCTGCTC CTTTCGTTGA TCTTCGTTCT TGTCAATCTG GCGGTCGATC TTCTTTACAG CGCCGTCGAT CCACGGATCC GTTATGACTA G
|
Protein sequence | MNRYLLQRLA LVIPTLAGVS LIIFALMRLL PGDVVDILFG GDTQADQRTL DQIRENLGLN RPLAVQYLEW IGGFLSGNFG VSMRTGIPVA ESIAQGMPVT LQLAVMAIFF ACLFAIPLGI IAAVRRNGVT DMLTRIVGLI GLSFPAFWLA TMFLLISSTM FRWTPPLGWV SPFADFGRNM QMMLAPALLL ALQPMAIIMR MTRASLLEVL RQDYIRTAYA KGLRDRAVLL RHALQNAFIP VLTVIGVQFG VLMGGSIIIE QIFSLPGIAF LLINGIYNRD YPVVQSTVLL LSLIFVLVNL AVDLLYSAVD PRIRYD
|
| |