Gene Franean1_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4000 
Symbol 
ID5672360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4783419 
End bp4784387 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content73% 
IMG OID641242878 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001508295 
Protein GI158315787 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG2146] Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.649548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGAT CCGAACGGCT GCTCGATCGG CTTGAGCGAG AGCAGAGGCT TGATGCCCCT 
GCGGAGGCAG TCTCGGGCTT CTGGTCGGCG GCGCTGCGCT CCGAACGGTT GCGCGATGTG
CTCAGCGGCC GGCAGCTCGG GCATCCGCTG CATCCCGCGG CGATCCTCGT CCCGGCGGGG
ACGCTGCTGA GCGCGACGAT GCTGGACACC ACTGGTGGCG CCGCCCTCCG CCCGGCCGCG
CGACGGCTCG TCGGACTCGG CCTGCTGTCG GCCGGACCAG CCGCGCTCGC CGGGTGGTCG
GACTGGCTCG ACACGAAGGG CGCCGAGCGC CGAGTCGGGC TCGTCCACGC CGCTTCCAAC
GTGGTGGGAC TCGCAAGCTA CGCGATCTCA TGGCGCCAGC GTCGTCGCGG CGCGCGTGGG
CTGGCCGCCA GTCTCGCGGG TGCGGCCGCC CTCGGCCTCG GCGGCTGGTT GGGTGGTCAC
CTCGCCTACG CGCTGGGGGT GGGGGTGGAC ACCACCGCGT TCCAGCGGGG GCCGGCCGAG
TGGACGGACG TGCTGGCGAC CAGCGAGGTC ACCACGGAAC TGCGGCAGGT CGAGATCGAT
GGGGTGCCGG TGCTGCTGAC CAGGGTCAAC GGTCAGGTCG TCGCGATCAG TGACCGCTGT
ACCCACCGGG GAGGCCCGCT GCACGAGGGC GAGCGCACCG GCGGCTGCGT GCGCTGCCCC
TGGCACGGAA GCCAGTTCGA GTTGGCCTCC GGCGAGGTCG TCCAGGGCCC CGCCACCCGT
CCGCAGCCGG TCTACGAGGT CCGCGAGACC GGCGGGCGGG TCGAACTCCG CCGCTCAGAG
GTCCGGACCC TACGCGCCAA CCCCGTCGGA CCGTCGCCCA GCCCGGACGC ACGGCTCCGG
ATCTCGACCG AAGGCCAGGC GGTGACCAGC GCGGCCATGC ATCGCACGAG CCAGCACGGA
ATATGGTAG
 
Protein sequence
MRGSERLLDR LEREQRLDAP AEAVSGFWSA ALRSERLRDV LSGRQLGHPL HPAAILVPAG 
TLLSATMLDT TGGAALRPAA RRLVGLGLLS AGPAALAGWS DWLDTKGAER RVGLVHAASN
VVGLASYAIS WRQRRRGARG LAASLAGAAA LGLGGWLGGH LAYALGVGVD TTAFQRGPAE
WTDVLATSEV TTELRQVEID GVPVLLTRVN GQVVAISDRC THRGGPLHEG ERTGGCVRCP
WHGSQFELAS GEVVQGPATR PQPVYEVRET GGRVELRRSE VRTLRANPVG PSPSPDARLR
ISTEGQAVTS AAMHRTSQHG IW