Gene Francci3_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0638 
Symbol 
ID3903316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp722278 
End bp723495 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content74% 
IMG OID637877971 
Producthypothetical protein 
Protein accessionYP_479751 
Protein GI86739351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.594729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTCG ACGAGCACGG CCGTTGGGTC AGCGACGACG GTGCCTACGT CTGGGACGAG 
GCTGCCCAGA CCTGGCAGCC GTCATCGGCA GCGCCGCCGG CTGGTTCGTC GGCCTCGGCG
CGATTCGGCA ACCATCCCGG TGGGCCCGCC TTCGGTGCCG GTCCGGCCGG GGAGCCCGGG
TCCGGTCGGG TCGGGTCCGA GGTAGGAGCG CAGGGCGGTT CGTTTGGTGG GCGGCCCGGT
GCTGCCGGCT CGTCGCCCGG TGCCGGACCC GGTGCCGGAC CCGGTGCCGT GGAAGCGGCT
CCGGCACCGC CGAGTTGGGG CGGGGCGCTC ACCGATCCAC CCCGCACCAT GGGCGGGGTG
GTGGCCACGC CGTCAGAGCC GGACGCGCGG GGGCTGGCAC CCTACGGCCC CACCGGCCAG
CGCGGCCGGT GGGGGGAACC GGCCGCGGAC GTCATTGCAC CGCACGGGCC GTCGGAGCGG
GGCGATCTGA CCGGTCCGGC CCGCCGCGCG GGTGCGGCGG ATCCCGTCGA TCCGGTGGCT
GCGTCCGCCG GGCCGGCCGC GGCGCAGACG ACCTGGGGCG ACGGCGACTC GACCGGCGAG
ATCCGGCGTG TCGGGGGCTT AGCGGCGACG TACGCCCCGT CCGCCGGCGG TGCGGCGGAC
GGCACATCGC GGTGGGACGA CGATCCGGAC GACGAGCCGG GGCCGTACGC CGGGTCGTTC
GACGAGCACG ACAGTGGATG GGCACCGTCC GGGCCCATCT CCCGGCGCGG GGCGACGGCT
CGGCGTGAAT CGACGGCTCG GCGCGGGGCG ACGGCCCGGC GCGACGAGGC CGGCGGGCTG
CCGGCACGGG TCACCGCGTT CGTCCAGCAC GTGCGCGACC GTCCGCCGTT GCTGATCGGC
GCCGCCGTCG TTCTCGTCTG CCTCGGGCTC GGCGTCATCG GATTCCTCGC CCTCGGCGGC
GGTGGATCGG ACTCCGGGAC CGCGGCCGGC CCGGCCGCGG CGGAGAAGGG CCGTTACTCC
CCGGAGGTCC GTCAGGCATA TCTCAGCTCG TGCCTCGACG TCAGCAACGG TAACGAAGGC
TATTGCACCT GCACGCTGGA GAAGTTGGAA GCCGGCTACA CCCAGGAGGA GTACCAGCGG
TTCAGTGACA ACGTCCAGTC CGAGTCGTCG CAGCGCATCG TGCGGGAGAT CTATGCCGCC
TGCCGTGACA AGCGATGA
 
Protein sequence
MRLDEHGRWV SDDGAYVWDE AAQTWQPSSA APPAGSSASA RFGNHPGGPA FGAGPAGEPG 
SGRVGSEVGA QGGSFGGRPG AAGSSPGAGP GAGPGAVEAA PAPPSWGGAL TDPPRTMGGV
VATPSEPDAR GLAPYGPTGQ RGRWGEPAAD VIAPHGPSER GDLTGPARRA GAADPVDPVA
ASAGPAAAQT TWGDGDSTGE IRRVGGLAAT YAPSAGGAAD GTSRWDDDPD DEPGPYAGSF
DEHDSGWAPS GPISRRGATA RRESTARRGA TARRDEAGGL PARVTAFVQH VRDRPPLLIG
AAVVLVCLGL GVIGFLALGG GGSDSGTAAG PAAAEKGRYS PEVRQAYLSS CLDVSNGNEG
YCTCTLEKLE AGYTQEEYQR FSDNVQSESS QRIVREIYAA CRDKR