Gene Francci3_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2140 
Symbol 
ID3905530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2508770 
End bp2510017 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content75% 
IMG OID637879475 
Producthypothetical protein 
Protein accessionYP_481241 
Protein GI86740841 
COG category[T] Signal transduction mechanisms 
COG ID[COG0589] Universal stress protein UspA and related nucleotide-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.283428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGACA TCGTGGTCGG GATCGACGGG TCGACAGATG CCGAGCGCGC GCTGCGGTGG 
GCCGTGCGCG CGGGGCGTCG TTGTAGCGCG CGGGTGCGTG CCGTGCTCGT CTGGGCCGCG
GCCGGCCCGT TCTGCACGCG GGTGAAGCCG CCGCCCGCCA CCTCACCGGA GCATCCCCGC
TGGACGGCCC AGTGGATGCT GCACGACGTC GTCAACCGGG TCCGCGAGGC CCAGCCCGAC
GCCTGGATCG TCGAGCGGAC CGTGTACGGG AGCCCGGTGG ACACGATACT GACCGAGTCG
GAGGGCGCGG TCATGCTCGT CCTCGGTGCC CGCGGCCGTG ACCGGCCGCG CCGCCTGCCG
GCCGGTTCGG TCAGCATGGC GTGCGTGTAC GGGGCGTCGG TGCCGGTGGT CGTGGTCCAC
GGCAGGTCGG CGGAGCCGGC CGAGAGCGGG CCCGTCGTCG TCGGGGTGGA CGGGTCCGCC
TGCTCGTTGG CCGCGCTGCG GTGGGCCGCC CAGGAGGCGG CGCTGCGCCG GGCGTCGCTG
CGCATCATGC ACGCCTGGGC GCCGCTGCCA CCGGAGTGCG TTGTCCGGCT GGGACTCTGC
GGTGCCAGGT CCGCTGGTGC GGTACCCACG GTGGGCGCTG GGGCCGTTCG TGCCGGGCCG
GCTGGTGCGG CGCCGGCTGG TGCGGCGTCG GCTGGTGCGG GGCTCGGCGA CACGGATCCG
GGCGGGACCG GATCCATCCG GACGGACCCG GGCGAAACGG GCTTCGCTCC GGGATTCGCC
GGCCGTGGAG TTCGGGGCAC GGCATCGGGG GGCACGGTAC TCGATGATCG TCCAGGATTT
GGTGACCCGG GGTCCGCCGA TGCGGCGCTC GGCGGGTCGG CGTTCGGCGG GTCGGCGTTC
GGGGGTCCCG CGTTCGGGGG TCCCGCGCTC GATGCGACGG GATCCGGCGG AACGGCCCTC
GTCGACGCCG CCCTCGAAGG CGCCGCGCGA GCCGTGCTGG ACGAGGCGGT GCGCGTGGGT
CTGCCCGAGC CGGGTGATCT CGATGTGCGC ACGGACCTGA TCCGCGGAGC GGCCGCGTCC
AGCCTGCTGC GGGCCGCGGC GGGGGCGCAG CTCCTGGTCA TCGGAGCCCG GGGCCGCGGC
GGATTCGCCG AGCTGCTGCT GGGCTCCATC AGTTCACAGT GTGTGACACA CGCCCAGTGC
GCGGTCGCGG TCATCCGCAC GCCCGGAAGT ACGACGTGTA GTGATTAG
 
Protein sequence
MADIVVGIDG STDAERALRW AVRAGRRCSA RVRAVLVWAA AGPFCTRVKP PPATSPEHPR 
WTAQWMLHDV VNRVREAQPD AWIVERTVYG SPVDTILTES EGAVMLVLGA RGRDRPRRLP
AGSVSMACVY GASVPVVVVH GRSAEPAESG PVVVGVDGSA CSLAALRWAA QEAALRRASL
RIMHAWAPLP PECVVRLGLC GARSAGAVPT VGAGAVRAGP AGAAPAGAAS AGAGLGDTDP
GGTGSIRTDP GETGFAPGFA GRGVRGTASG GTVLDDRPGF GDPGSADAAL GGSAFGGSAF
GGPAFGGPAL DATGSGGTAL VDAALEGAAR AVLDEAVRVG LPEPGDLDVR TDLIRGAAAS
SLLRAAAGAQ LLVIGARGRG GFAELLLGSI SSQCVTHAQC AVAVIRTPGS TTCSD