Gene Francci3_4524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4524 
Symbol 
ID3907501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5397679 
End bp5398836 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content69% 
IMG OID637881857 
Producthypothetical protein 
Protein accessionYP_483599 
Protein GI86743199 
COG category[S] Function unknown 
COG ID[COG5542] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.446454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATGATC CGATCGGGCC GGGGATGGTC CGAGCGGCCT GGGCCCGGCT GCCGGAGCCG 
GTCCGGGCCA CGCTGCCGAT CTGGTTCGGG AGCCGGGTCG CGGTCGCCCT GCTGGCGCTC
GCCGCCGCGC GGGTCCTTAC CTCGGCGCCG GCGCGAAACG CTCCCGGCCT GCGCACGCTC
TGGGACCGGT GGGATGTCGG CCTGTTCACG AAGGTCGCCC GGTACGGTTA TCTCTCCCCC
GCCTACTCGG ACCGGACAGA AGTCGACTTT CCCGGCCTCC CGATCGCGAT GCGGATCGTC
CATATTGTCG TTCCGGACTG GATTGTCGCG GGTCTGGTCG TGTCGTTCCT CGCCGGTGCG
GTCGCGGTCG CCGCGCTGTG GCGGTTGGCC GCGGACGAGG TCGGCAAGTC CGCGGCGCGG
TTCGCCGTCC TCAGCCTCAT CTGCTTCCCC TACGCGGTGT TCCTGTTCGC CGCCTATTCG
GAGGGCATCT TCCTGGGGTT CGCGACGGCG TCCTGGCTGG CGGCACGTCG CAGCCGCTGG
TGGTTGGCCG GCCTGCTGGG CGCCGGGGCG GCATCCGCCA GGATCAGTGG TATTCCCTTC
GGGGTCGCGT TGGCCGTGCA GTACGTCGCG AGCCGGCGCT CCGCCGGCCT CCCGGTGTTC
GCCCGGCCGG CGTTGTCGCT GGGGCTTCCA CCCATCCCCG TCCTCGCCTA CCTGGGCTAT
CTCAGGATTC GCACCGGGGG CTGGGGCTCC TACACGGACG CGATGCGGGA CGGCTGGCAC
CGAGGCCTCG ACTGGCCCTG GTCGGGCTGG ACCGCCACCT GGGCATCGGC CACGGATGGC
AACGGTGCGT CGACCTTCGT CTGGTTCTGG CGCGGGGAAC TGCTCGCGGT GGTCGTCGGG
GTACTGCTGA CCATCATCCT GCTGGCCGGG AGACGGTGGG GTGAGGCGAC CTTCCTCGGC
GTGATGACCG TGATAATGGC CTGCACGAAT TATTATGCAT CGGGAATCCG TGGAATTTTA
GTGGCCTTTC CGTTGTATCT GATGCTCGCG CGAGCCGCCG CGCGGCAGAG CTGGGTACAG
TCGGTATATC TATGTACCGG TACGCCGATC ATGGCAGCCC TGGTCGTCGC CTTCACGCAG
GGACAGTGGG TGGACTAG
 
Protein sequence
MHDPIGPGMV RAAWARLPEP VRATLPIWFG SRVAVALLAL AAARVLTSAP ARNAPGLRTL 
WDRWDVGLFT KVARYGYLSP AYSDRTEVDF PGLPIAMRIV HIVVPDWIVA GLVVSFLAGA
VAVAALWRLA ADEVGKSAAR FAVLSLICFP YAVFLFAAYS EGIFLGFATA SWLAARRSRW
WLAGLLGAGA ASARISGIPF GVALAVQYVA SRRSAGLPVF ARPALSLGLP PIPVLAYLGY
LRIRTGGWGS YTDAMRDGWH RGLDWPWSGW TATWASATDG NGASTFVWFW RGELLAVVVG
VLLTIILLAG RRWGEATFLG VMTVIMACTN YYASGIRGIL VAFPLYLMLA RAAARQSWVQ
SVYLCTGTPI MAALVVAFTQ GQWVD