Gene Francci3_2180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2180 
Symbol 
ID3906780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2553665 
End bp2554933 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content61% 
IMG OID637879513 
Producthypothetical protein 
Protein accessionYP_481279 
Protein GI86740879 
COG category[S] Function unknown 
COG ID[COG1479] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.912338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.467765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGTAT CCTGTGCGAC AGCTTGGCCA CGTGGGCTGG TCGACTCGGC GAAGGCTCCC 
GCCACCCGCG CTGGACGGTC CCACTTGGCG GCCTGCCCTG AGTTCTTACT GACTCGGCAC
CATGTCGCAC GGCTTTCCGA GGGGGTGACC GAGCTGGCTG AGCAAGCTGG CAACGACAGC
GACTCCGAGG ACCTGCGCGA CGACGACAGG GCCGAGCTTC AGGAGGTGGA CCCCGACCCT
CCGCAGATCA GCTACAGCGG CACCGACTTC GACGTCGAGG GTCTGGTCCG TCGGCACGAT
CGGGGCGACA TAATAGTGCC GTCCTTTGGC AATGATGATC CCGGCATCGA GACAGCCGGC
TTCCAGCGCG AGTTCGTTTG GAAGCGTTCG CAGATGGACC GTTTCATCGA GTCCCTACTG
CTCGGATACC CCATTCCGGG TATATTCCTT GTCCAACAAC AGGACCGCCG CTATCTCGTT
CTTGATGGAC AGCAGCGGAT AAAGACCCTG AGCCTCTTCT ATAATGGCAG CATCAACGGG
CGCGAGTTCG CACTTCAGAA CGTGGCCGCC AGATTCCAGG GGCTGACCTA TCAAACTTTT
TCACCCGAAC AGCGTCGCAC GCTCGACAAT ACCTTCATCC AGGCGACAAT AGTCAAAACC
GACGGCACCC GCGAGTCACT CGACGGCGTT TATCAGATCT TCGAGCGGCT GAACTCGGGC
GGTACGCAGC TCACGCCGCA CGAGATTCGC GTGGCGCTCT ATGCAGGCGA GTTCATCAAG
TTCCTCACCG CTCTGAACGA AAACCCGGCG TGGCGCGCTC TCTACGGGCC GCCATCACCA
CGGCTACGCG ATCAGGAGAT CGTGCTCAGA TTCATCGCCC TCTACGTGTC ACCGGGTAGC
TATAAGCGCC CCCTCAAGAA ATACCTGAAC GATTTTGTTG GCGCTCACCG CCGACTGAAC
GAACTGGACG CCGAGTTGAT CGAAAAACGA TTCGACAGGG CAGCACAGCT TGTGTTGGAG
GAGGCCGGAA GAAGCGCCAT TCGCGGCCGG GGGCGTCAGC TCAATGCGGC TCTCACCGAG
GCGCTTTTGG TAGGATTGGC CCGTAGGCTT GATGCCGGTA GCGAACCGAC CGCAGCTGAG
GTCAGCCGCG CCATCGACGC GCTCCTCAAC GAACCCGACC TGGATTACGT GACCACGCGC
GCAACGGCCG ACGAGGAGAG TGTGCGGATG CGCCTGGCGC TGGCAACGAG AGCTTTCTCC
CGCATCTGA
 
Protein sequence
MVVSCATAWP RGLVDSAKAP ATRAGRSHLA ACPEFLLTRH HVARLSEGVT ELAEQAGNDS 
DSEDLRDDDR AELQEVDPDP PQISYSGTDF DVEGLVRRHD RGDIIVPSFG NDDPGIETAG
FQREFVWKRS QMDRFIESLL LGYPIPGIFL VQQQDRRYLV LDGQQRIKTL SLFYNGSING
REFALQNVAA RFQGLTYQTF SPEQRRTLDN TFIQATIVKT DGTRESLDGV YQIFERLNSG
GTQLTPHEIR VALYAGEFIK FLTALNENPA WRALYGPPSP RLRDQEIVLR FIALYVSPGS
YKRPLKKYLN DFVGAHRRLN ELDAELIEKR FDRAAQLVLE EAGRSAIRGR GRQLNAALTE
ALLVGLARRL DAGSEPTAAE VSRAIDALLN EPDLDYVTTR ATADEESVRM RLALATRAFS
RI