Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4110 |
Symbol | |
ID | 4245624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6340300 |
End bp | 6342288 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638109011 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_723591 |
Protein GI | 113477530 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components [COG4176] ABC-type proline/glycine betaine transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.580114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTATTT TGAATTATTT ATCTTATTAC TTGATCACTG AAACTCAAGG AGGATTAAAC AATTTATTGA ACCCTTTCGA GTCTTATACT TTACCCCTGG AGGAATGGGT TAACAATTTA GTTGATTTTC TGGTTGACAA CTTTCGGTTT ATATTCCAAG CTATCAGCTT ACCTATTAGC GGAACTCTAA GAATAGTAGA ATTGACTTTT CTAGCAATTC CGCCTTTAAT TTTTCTCATT ATCACTGGTT TGCTGGTCTG GCAATTAGCA GGTAGAAAAA TAGCTATATA CAGTTTTATT TCTCTTTGTA TTATTGGTTT TTGTGGTGCT TGGGAACAAT CAATGGTTTC CCTATCTTTA ATACTTACTG CTGTAATATT CTGTATGCTT ATTGGTATAC CTCTAGGTAT TGCTTGTGCC CTGAGCGATC GCTTTAATAA AATCTTGCGA CCTCTCCTAG ATGGGATGCA AACTCTTCCA ACTTTTGTTT ATCTGGTACC TGTAGTAATG CTATTTGGTA TTGGCGAAGT TTCTGGTATA ATAGCGACTT TTGTTTTTGC AGTTCCTCCT CTAATTCGCC TGACCAACCT GGGCATTAGA CAGGTGTCCC CAGAAGCGGT AGAGGCTGCA CTTGCGTTTG GTTCAACTCG CCAACAAGTT TTGTTGGAAG TGCAAATACC TCTAGCTTTA CCTACTATAC TTGCTGGTAC GAACCAAGCC ATTTTATTAG CCTTATCGAT GTCAGTTGTC ACCTCAATGA TTGGAGTAGA GGGATTAGGA CAAATGGTAT TACAAGGTTT AGGACGTTTG AATGTGGGAC TAGCGGCAAT AGGGGGCTTA GGTATTGTAT TAATTGCTAT AATGTTAGAC CGTATTACTC AAGGTGTTAG TCAAGGTAAT ATTCAGTCTT GGCAAGACCG TGGTCCTATC GGTTGGTGGC GGTCTCGTAA GTTTTTCAAA CATCATAAAA CAACTTTAGG GATAAGTATT CTGCTTGCCT TGTTAGTTGG TGTGACAGGT TGGCAATTAA TGTCTCAAAA AAATATCAAG TCAACTGAGC ATCTACTGCC AGGGGAAGGA GTAGTGGTGC GTCCAACTTC TGGCTTAGAA ACCTACGGTA TATTCACCAC AGAAATTGTG AATATTGGAC TGGAAAAATT AGGATATGAG ATAAGAGGTG AAAAGCAACT GAATGTTCCA GCAATGCATC TGGCTGTTAG CAATGGAGAT TTAGATTTTG CTGGAACCCA TTGGGAAGCT AGCCATCAAG AATTTTTTGA CAATAATGGT GGGGAAGAAA AATTAGAACG GCTAGGAACT CTGATTTCTA ATTCTACTAT GGGGTATCAG ATAGATAAAA AAACAGCAGA TAAGTACAAT ATTACTAATA TATCACAACT TAAAGAACCA AAAATTGCTA AACTTTTTGA CTCTGATGGT GATGGTAAGG CAAATTTAAT TGGTTGTACT GCTGGTTGGT CGTGCGAAAG AGTCATAAAT CATCATTTAG AGGTTTATGG ACTTGAAGAT ACTGTTGAGC AAATTCAAGG AAATTATTCT TCTTTGTTGG CAGATATTTT AGTTCGTTAT CGTCAAGGAA AACCAATTTT ATTTTTTGCT TGGAAACCCC ATTGGTTTTC TTCTATTTTA AGAGAAGGAG AAAATGTAGA ATGGTTGAGT GTTCCTTTTA CATCTTTGGT AGGAACTATG GAAAACTTCA CGGAAAAAGA TACTTTATTT AATGATAAAA ATATCGGTTT TCCTATTGAT AATGTCAGGA TATTAGCTAA TAAGAAATTC TTGAAGGCTA ACCCAGTTGC TAAACGTTTA TTTGAACAGA TTGAGATTCC TTTAGAGGAT GTTAGTATTG AACAGGAAAA AGTAAAAAAT GGAGAAAATA AACCTATTGA TATTCGCCAT CATGCTGAGG AATGGATAGT TAATCATCAA GGGTTATTTG ATAGTTGGTT AGAAATAGCA AAAAGTTAA
|
Protein sequence | MFILNYLSYY LITETQGGLN NLLNPFESYT LPLEEWVNNL VDFLVDNFRF IFQAISLPIS GTLRIVELTF LAIPPLIFLI ITGLLVWQLA GRKIAIYSFI SLCIIGFCGA WEQSMVSLSL ILTAVIFCML IGIPLGIACA LSDRFNKILR PLLDGMQTLP TFVYLVPVVM LFGIGEVSGI IATFVFAVPP LIRLTNLGIR QVSPEAVEAA LAFGSTRQQV LLEVQIPLAL PTILAGTNQA ILLALSMSVV TSMIGVEGLG QMVLQGLGRL NVGLAAIGGL GIVLIAIMLD RITQGVSQGN IQSWQDRGPI GWWRSRKFFK HHKTTLGISI LLALLVGVTG WQLMSQKNIK STEHLLPGEG VVVRPTSGLE TYGIFTTEIV NIGLEKLGYE IRGEKQLNVP AMHLAVSNGD LDFAGTHWEA SHQEFFDNNG GEEKLERLGT LISNSTMGYQ IDKKTADKYN ITNISQLKEP KIAKLFDSDG DGKANLIGCT AGWSCERVIN HHLEVYGLED TVEQIQGNYS SLLADILVRY RQGKPILFFA WKPHWFSSIL REGENVEWLS VPFTSLVGTM ENFTEKDTLF NDKNIGFPID NVRILANKKF LKANPVAKRL FEQIEIPLED VSIEQEKVKN GENKPIDIRH HAEEWIVNHQ GLFDSWLEIA KS
|
| |