Gene Tery_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1550 
Symbol 
ID4242029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2362918 
End bp2364054 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content40% 
IMG OID638106693 
ProductPRC-barrel 
Protein accessionYP_721303 
Protein GI113475242 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.432457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCTG AAAAAAACCG CCAACGCTCT GAACTGTTAG GAACTCAAAT AATTACTCGT 
GATAAAGGTA AACGCCTAGG AGTAGTAAGT CAATTATGGG TAGATGTTGA TAAACGGGAA
GTAGTAGCTA TTGGGCTACG GGACAATATA TTGGCAGTTG CTGGAATACC TAAGTTTATG
TTTCTCAAGG ACGTCTGTGA AATAGGTGAT GTGATATTAG TAGATGACGA AGAAGTAATA
GAGGAAGATA TTGATACTGA AGCTTATAGT GGGTTGATAA ATAGTGAAGT TTTGACAGAA
AATGGTGATT TTTTAGGCAG GGTTCGAGAT TTCAAGTGTG ATGTAAGAGA TGGTAAAGTT
TTGTCATTGA TTATCGCTTC CATTGGTATA CCCCAAATTC CAGACCAAAT TATCAGCACT
TATGAAATGC CTATTGATGA AATTGTTGCT AGTGGGCCCA ATCGTCTAAT TGTTTTTGAA
GGTTCTGAAG AAAAACTACA ACAGTTAACA GTAGGAGTAT TGGAACGTCT GGGTCTAGCA
GAAGCACCTT GGCAAAAAGA GGAAGAAGGT TTATATAACC CTCCTACTGT AAATCCTGAT
AACCAATTAG GTCCAGGACA GTTAGTGTCT CGTGAACCTA TTCGTACAGC CAGACGCTCT
ACTCAGGAGA CATGGGATGA TAACTGGGTT GAACCAGAGC CAGTAGAAAG TAGGGTAATT
GAACCAGAAC CAGTTTATCG TAAGTATTAT GAACAAGAAA TGGCTCCTCC TCCCCGGCAG
CGAGAACAAG AAGCGGTTTA TGATAATTAT TATGAGCCAG AGCCTGTTGC TCCCCCATCA
GCGCGTCAGG TAAGGGTAGA ACCTGCTGAT AATGACTACT ATGAGGAAGA AGGTAATTGG
GGTGACTCAG AGACTGGATA TGATAATGAA CAGGATAGAT ATCAGAAGAA AGAGGATGGA
TATCAGAAGA AAGAGGATGG ATATCAGAAG AAAGAATATA AAGCAACTTC GGAGTATGAA
TATGATGAGG AGATAGATAG AGATGCTTGG GCTGATGATG AGGCTCCAAA ACCTTATCAG
GCTCCACGGG TGAATATTCC TGAAAAAACT AAGGTCCCAG AATATGAAGA TTATTAG
 
Protein sequence
MRSEKNRQRS ELLGTQIITR DKGKRLGVVS QLWVDVDKRE VVAIGLRDNI LAVAGIPKFM 
FLKDVCEIGD VILVDDEEVI EEDIDTEAYS GLINSEVLTE NGDFLGRVRD FKCDVRDGKV
LSLIIASIGI PQIPDQIIST YEMPIDEIVA SGPNRLIVFE GSEEKLQQLT VGVLERLGLA
EAPWQKEEEG LYNPPTVNPD NQLGPGQLVS REPIRTARRS TQETWDDNWV EPEPVESRVI
EPEPVYRKYY EQEMAPPPRQ REQEAVYDNY YEPEPVAPPS ARQVRVEPAD NDYYEEEGNW
GDSETGYDNE QDRYQKKEDG YQKKEDGYQK KEYKATSEYE YDEEIDRDAW ADDEAPKPYQ
APRVNIPEKT KVPEYEDY