Gene Franean1_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2078 
Symbol 
ID5670479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2503487 
End bp2504542 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content73% 
IMG OID641241000 
Productcytochrome oxidase assembly 
Protein accessionYP_001506421 
Protein GI158313913 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1612] Uncharacterized protein required for cytochrome oxidase assembly 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.7047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGCGC GCACCCCTTG CCACGGACTC GGGCGGATTA CTATGACCGC GATGCCAACT 
TCTACCGATC GTCGGAGCAA GCCCCGCGAG AGCGGCGGCG CCAGCCAGCC GGACACCACC
ACCGGCCGGC TGCCGGTGAT CGGGCTGCGC GCCTTCCGCC GGCTGACGCT GGCCAGCGTC
CTCCTGCTCG CCGCGATCGT GGTGACCGGC GGCGCCGTCC GGCTGACCGG CTCCGGGCTG
GGCTGCCCCA CCTGGCCGCA GTGCGGCGAC GGCTCCTTCA CCCCGCACTC GGCGTACGCC
CTCAACGGCG CCATCGAGTT CGGCAACCGG GTCATCAGCA TCGTCGTCGG CCTGGTCGTG
CTGGCCCTGC CGCTCGCGGC CCGGCGGCTG CGCGAGCCGC GCCGGGACCT GCTGCTGCTC
TCCCTCGGCC TGTGGCTCGG CTTCGTCGGC CAGGCAGTGC TCGGCGGGAT CACCGTGCTG
GTGAAGCTGC ACCCGGCCAC CGTCGCCGCG CACTTCCTGC TGTCGATGGT CCTGCTGTTC
AACGCCGTCG CGCTGCACCG ACGAGCCCGG CAGGCGGCCG GGCCGACTCC GCACGCCGTC
CGCCCGGAGC TGCTCTGGCT CGCCAGGCTG CTGATGACCG TGGCCGGCGG CGTGCTCGTC
CTCGGCACCG TCGTGACCGG CACCGGGCCG CACAGCGGCG ACAGCGAGGA CACCAAGCGG
TTCGGCTTCG ACATCGTCAA CGTCGCCCAG CTCCACGCCG ACGGCGCGAT GATCCTCACC
GGCCTCACGG TTGCGATGAT CTTCGCCGTC CGGCTGGCAT CCGCCCCGGC GGAGGCCAGC
CGCAGCGCCA ACGCGCTCGC GCTGACCGTC GTCGCCCAGG CCGCGATCGG CTTCACCCAG
TACTTCGCCG GCATCCCGCC GCTGCTCGTC GCCCTCCACA TGGCCGGCGC GACCATCATG
TGGATCGTCA CCGTCCAGCT CTGGCTCGCC ATGAGCGAAC GCCCCCCGGC CGGCGAGAAC
GCCTGGACGG GCTCCCGCCA ACTCGCCGCC GGTTGA
 
Protein sequence
MAARTPCHGL GRITMTAMPT STDRRSKPRE SGGASQPDTT TGRLPVIGLR AFRRLTLASV 
LLLAAIVVTG GAVRLTGSGL GCPTWPQCGD GSFTPHSAYA LNGAIEFGNR VISIVVGLVV
LALPLAARRL REPRRDLLLL SLGLWLGFVG QAVLGGITVL VKLHPATVAA HFLLSMVLLF
NAVALHRRAR QAAGPTPHAV RPELLWLARL LMTVAGGVLV LGTVVTGTGP HSGDSEDTKR
FGFDIVNVAQ LHADGAMILT GLTVAMIFAV RLASAPAEAS RSANALALTV VAQAAIGFTQ
YFAGIPPLLV ALHMAGATIM WIVTVQLWLA MSERPPAGEN AWTGSRQLAA G